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SEPTEMBER, 1936 


BOUNDARY CONDITIONS IN THE COMMON-FACTOR-SPACE, 
IN THE FACTORIAL ANALYSIS OF ABILITY 


by 
GODFREY H. THOMSON, 


Moray House, University of Edinburgh. 


The author arrives at a simple rule for ascertaining when a 
matrix of correlations, with communalities reducing it to minimum 
rank, cannot be analyzed into factors such that every column of 
loadings has at least as many zeros as the number of common 
factors, as required by Thurstone. A more exact but arithmetically 
tedious rule is also deduced from Ridley Thompson’s boundary con- 
ditions, and a correction is made to the latter. 


I. Object of the present paper. 


In his latest method of factorial analysis, which is so clearly and 
completely described in The Vectors of Mind, Professor Thurstone 
finds those “communalities” which, when inserted in the diagonal cells 
of a sufficiently large* matrix of correlations, reduce its rank to the 
lowest possible number. He shows that the tests which give the cor- 
relations can be expressed in terms of common factors equal in num- 
ber to that minimum rank, together with specific factors whose “‘load- 
ings” or coefficients are the square roots of the defects of those “com- 
munalities” from unity. Thurstone gives a beautifully simple method 
of arriving at one set of the loadings, in each test, of this small num- 
ber of common factors. These loadings which are first obtained in- 
clude negative values, and Thurstone proceeds to “rotate the axes” of 
the common factors until there are no negative loadings, and until a 
large number of them are zero. He lays it down (on pages 151 and 
156) that (1) each row of the matrix of loadings must have at 
least one zero, that is each of the tests must lack one at least of the 
factors; (2) that each column must have at least as many zeros as 
there are factors, that is there is to be no general factor, for every 
factor is to be absent in several tests; and (3) that in each pair of 
columns there must be at Jeast as many tests, whose loadings vanish 
in the one but not in the other column, as there are factors in the 
analysis. The object of the present paper is to enquire under what 
circumstances Thurstone can succeed in attaining the second of these 


*See Thurstone’s Table 2, page 77 of The Vectors of Mind. 
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three desiderata. In accordance with Thurstone’s notation I shall 
use x for the number of tests, and 7 both for the rank of the matrix, 
and for the number of factors. The symbol r is not to be confused 
with a correlation coefficient, from which it is distinguished by the 
absence of suffixes. Further, I shall call a factor which has loadings 
(other than zero) in two tests, a 2-factor; one which has loadings 
(other than zero) in three tests, a 3-factor; and so on up to a general 
factor which has loadings in every test, i.e. an n-factor. The question 
to be answered is therefore, under what circumstances can a set of n 
tests be expressed in terms of r common factors (plus of course n 
specific or 1-factors) none of which has more than (n—,r) loadings 
(leaving at least r zeros) ? Note that this question applies to the com- 
mon-factor-space, that is, to a space of 7 dimensions (where r is the 
minimum rank of the correlation matrix) which is at right angles to 
all the specifics. 


II. J. Ridley Thompson’s Boundary Conditions. 


Now a similar question, referring to the whole of space of which 
the n test-vectors define a subspace, has already been partly answered, 
and my task is therefore that of adapting that answer to the limita- 
tions of the common-factor-space. In the year 1919 I asked myself 
what values the three correlations of three tests must have if it were 
to be impossible to express these tests without a general factor run- 
ning through all three of them, and I noted that the answer was, ap- 
proximately, that a general factor was essential if the correlations of 
the three tests averaged more than 0.5, or those of tests averaged 
more than (7—2)/(n—1). I suggested the problem as a piece of re- 
search to Mr. J. Ridley Thompson, now of Sheffield University, and he 
gave an answer, in determinantal form, for three tests.* Later, in a 
series of papers, he gradually generalized the theorem, his final form 
(which is in fact not quite correct) being the following statement.7 


*See appendix by J. Ridley Thompson to my paper “The Proof or Disproof 
of the Existence of General Ability” Brit. Journ. of Psychol. 1919, IX, 335, and 
see also page 142 of The Essentials of Mental Measurement by Brown and Thom- 
son. 

+“The General Expression for Boundary Conditions and the Limits of Cor- 
relation”, by J. Ridley Thompson, Proc. Roy. Soc. Edinburgh, 1928-9, Vol. XLIX, 
pp. 65-71. In passing it may be pointed out that in lines 12 to 18 of his page 69, 
Ridley Thompson came very near anticipating Thurstone’s fundamental propo- 
sition on rank and factors. 
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If A be the determinant 


(1) 


A = (—1)” 113 23 k % T3n | ; 


v; n T2 n rs n : k 








then as k is given the values —1, —2, —3 --- and so on, (always the 
same value, be it noted, along the whole diagonal) the sign of 4 will 
change from minus to plus at the point where the higher order fac- 
tors are unnecessary, as shown in this table: — 








Table I. 
k If A is minus If A is plus 
—!1 3-factors or higher fac-| 3-factors (and higher factors) 
tors are necessary. are unnecessary. 
—2 4-factors or higher fac-| 4-factors (and higher factors) 
tors are necesary. are unnecessary. 
—(n—2) n-factors are necessary.| n-factors are unnecessary. 








The probable error of A has been given by Dr. T. P. Black* to whom 
also the determinantal form of the final theorem is due, for curiously 
enough, though J. Ridley Thompson gave his first result in that 
form, he did not do so in the general case. 

I may point out here that Ridley Thompson’s is really a theorem 
concerning the positions of the latent roots of the correlation matrix 
R, that is the roots of the characteristic equation 


|R—AI|=0 


and that Ridley Thompson’s diagonal quantity k is equivalent to 
1 — 4A. When I realized this, towards the end of 1935, I thought at 
first that Ridley Thompson’s theorem must imply that a correlational 
matrix cannot have more than one latent root greater than 2, (cor- 
responding to his k — —1), for if, for example, 4-factors and higher 
factors are unnecessary, then the sign of A should not again change. 

*Mental Measurement: The Probable Error of Some Boundary Conditions 


in Diagnosing the Presence of Group and General Factors,” Proc. Roy. Soc. Edin- 
burgh 1928-9, Vol. XLIX, pp. 72-77. 
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But Dr. Aitken, to whom I mentioned this, replied by constructing a 
correlation matrix which did possess two roots greater than 2, and 
Mr. Walter Ledermann has since made several such for me. Clearly 
therefore Ridley Thompson’s theorem as he stated it is incomplete. 
The sign of A may change several times, and it is only the last of 
these changes to which his theorem refers and then only as a neces- 
sary, and not a sufficient condition. This may be expressed by saying 
that if 4 is the largest latent root, then (s+-1) factors or higher fac- 
tors are necessary (where s-+-1 is the integer immediately above 4). 
Mr. Walter Ledermann is at present inquiring into the whole connec- 
tion of factor-extent and latent-roots.* 

This difficulty in the application of Ridley Thompson’s theorem— 
the difficulty, that is, of being certain that when the sign changes to 
plus it will remain plus — makes the simple approximate method now 
to be described more useful, for this difficulty does not arise there. 


III. Approximate Solution of the same Problem. 


The approximate solution of the general factor problem which 1 
gave above, namely that when the average of the correlations exceeds 
(n—2) /(n—1) a general-factor is essential, can be readily extended 
to cover all the boundary conditions for n variables. It is simply that 
when the average of the correlations exceeds (s—-1) /(n—1), s being 
a positive integer, an (s-+1)-factor or a higher factor is essential. 
This expression is derived by considering the case where there are 
only s-factors, including all possible s-factors, and where all loadings 
are equal. In that case all the correlations are alike, and are equal to 
(s—1)/(n—1), and this is the highest average amount of correla- 
tion that can be produced without going higher than s-factors. The 
method of using this rule therefore is to calculate the average of all 
the correlation coefficients, and to note where this average value fits 
in, in the series of fractions 


1 == 
td bel ek _— 





Thus if n is 21 (say), and the average correlation coefficient is .632, 
this value comes between 12/20ths and 13/20ths 


12 13 
30 < .6382 < 50 


and this shows that at least one 14-factor or higher factor is necessary. 
Both Ridley Thompson’s exact theorem, and my approximate substi- 


*See his paper immediately following this article. 
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tute for it, when applied to the measured correlations, are concerned 
with all possible analyses of the tests, and are not confined to the 
common-factor-space. 


IV. Application of Boundary Conditions to Corre‘ations 
Corrected for Communality. 


In order to apply these theorems to the common-factor-space it 
is necessary to “correct” the correlations first for communality. Each 
test-vector can be divided into two components, one (its specific) along 
an axis at right angles to the common-factor-space, the other in the 
common-factor-space. {t is the latter with which we are to deal, and 
the cosines of the angles between them are given by 


Vij 
ss ieee - (2) 
where 7;; is the correlation coefficient, and h;? and h;? are the two com- 
munalities. These latter quantities 7’;; therefore it is into whose mat- 


rix the diagonal elements k — —1, —2, etc. have to be inserted in ap- 
plying the boundary conditions to the common-factor-space only :— 





k Min M33 * Min 
112 k 03 . Ton 


piuakind | 113 1o3 k -  M3n (3) 
| : 
| 





Nin Yon Yan : k 


It is however unnecessary to calculate the quantities 7’;;.. Instead we 
note that 








| kh? 112 Fig * Bip 
1 Tye Kha? 23 + Ton 
Ac + (—1) n 
oe | Tis %23, Shs? + Pan (4) 
| Tin Ten fo « BS 


Since the communalities h;? are all positive quantities, the sign of A’ 
depends upon the sign of the determinant on the right. The rule there- 
fore is to insert into the diagonal cells of the ordinary correlation mat- 
rix k times the communality, and examine the sign of the resulting de- 
terminant multiplied by (—1)*. The rotation desired by Thurstone, 
giving r zeros in each column of loadings, is impossible, if the sign 
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be minus for k — —(n—r—1), for (n—r-+1)-factors or higher fac- 
tors will then be essential. 

Similarly, the approximate method of section III can be applied, 
the only change being that it is the average of the “corrected” corre- 
lation coefficients (“corrected” for communality) which is to be used. 
A numerical] illustration will illuminate both methods. 

Before turning to the numerical illustration I should however 
refer to one point regarding the application of these theorems, or at 
least of my own, to the common-factor-space. I arrived at my theorem 
by the consideration that, when every s-factor is used, the maximum 
attainable correlation is (s — 1)/(m — 1). In the common-factor- 
space however not every s-factor can be used, for the number of col- 
umns of loadings is fixed by the rank. This imposes a new condition, 
and will make it still more difficult to attain a high average correla- 
tion without the use of an (s+-1)-factor. It may be expected there- 
fore that an (s-++-1)-factor will be doubly essential if the average cor- 
relation exceeds (s—1) /(n—1). Light is thrown on this by Mr. Led- 
ermann’s paper. 


V. Arithmetical Illustration: Approximate Method. 
Consider the following matrix of correlation coefficients 


1 2 3 4 5 6 7 8 9 





| 
| 


89 44 AT 40 48 56 40 30 35 
44 25 24 15 18 21 32 24 .28 
AT 24 25 20 24 28 24 18 21 
40 15 20 89 46 83 o2 56 24 
48 18 24 46 40 54 08 14 .06 
56 21 28 83 54 85 24 42 18 
40 32 24 32 08 24 80 76 68 
30 24 18 56 14 42 -76 85 63 
515) 28 21 24 .06 18 68 63 8 


eonananrisoartk WwW NO 





which can be reduced to rank 3 by the communalities shown. Let us 
inquire, first of all by the approximate method, whether after analysis 
into three common factors, there is the possibility that the axes can 
be rotated in the common-factor-space until there are three zeros in 














GODFREY H. THOMSON 161 


every column. The 36 correlations corrected for communality have to 
be averaged, which can most readily be done by deleting the com- 
munalities in the matrix, dividing each column by the root of that 
column’s communality, adding the rows, and dividing each total by 
the root of that row’s communality. The grand total must then be 
divided by 72. Without troubling to keep many decimals (for the 
method is in any case only approximate) we obtain .585 for the aver- 
age of the corrected correlations. In the fraction (s—1)/(n—-1) the 
denominator equals 8, and .585 comes between 4/8ths and 5/8ths. The 
numerator s—1 is therefore 4, and so s is equal to 5, that is 6-factors 
or higher factors are essential. This leaves the possibility open that 
three zeros may occur in each column. The actual analysis, after ro- 
tation, is:— 


FACTOR LOADINGS 
II III Communality 

89 
25 
25 
89 
40 
85 
80 
85 
58 


Test 


p= 
wb wooo 


NYNawrnodcocowy»inpn eu 


I 
8 
3 
4 
5 
6 
7 
0 
0 
0 


Co oON Do BP ~- DH 
on & & 


If however the correlations in the last row and column of the 
matrix, instead of being as shown, had been 


75 43 .41 49 36 53 .68 .63 


the communality of the ninth test, to give rank 3, would have been 
.83. If for this modified matrix we find the average of the correlations 
corrected for communality we obtain .633, which comes between 
5/8ths and 6/8ths so that s—1 — 5, and s is 6, that is a 7-factor or 
higher factor is essential. In at least one column therefore the zero- 
loadings must be less than three. The actual analysis and rotation 


gives:— 
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FACTOR LOADINGS 
II III Communality 

89 
25 
25 
.89 
40 
85 
80 
.85 
83 


Test 


as 
mt tb 


SON fF & 
orXNPRODObw DBD Ooo 


 cCaonisew fF wn 
Ynnodcicdwnpian 


on 


It is impossible to obtain three zeros in every column with this battery. 


VI. Arithmetical Illustration: Exact Method. 


If three or more zeros are to be possible in each column of load- 
ings, the determinant of equation (4) multiplied by (—1)”, must be 
positive for k = —(n—1—1) = —(9—3—1) —= —5. The arithmeti- 
cal task therefore is to evaluate the determinant with —5 times the 
communalities in the diagonal. This is not a formidable task if Ait- 
ken’s method of pivotal condensation* is used, but the comparative 
ease of the approximate method makes it unlikely that this exact 
method will be employed except for theoretical inquiries. Mr. Ian 
Wishart has performed the calculations for me and finds. 


fork—=—5, A’=—+ 1705.5 ~h,?--- hg? 
fork——4, A’—— 816.0 ~h,?---h,? . 


A 6-factor or higher factor is therefore essential, as we found with 
the approximate method. When the last row and column of the matrix 
are altered as before to the higher correlations of the alternative test 
9, we find that the value of A’ changes, for k = —5, to —115.5 + 
h,? «++ hy”, so that, with these correlations, at least one 7-factor is nec- 


*“Note on the Computation of Determinants”, by A. C. Aitken, Transactions 
of the Faculty of Actuaries (Edinburgh) 1931, Vol. XIII, pp. 12-15. See also my 
paper “Some Points of Mathematical Technique in the Factorial Analysis of 
Ability”, Journ. of Educ. Psychol. January 1936, pp. 45-6. 
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essary, and Thurstone’s requirement of three zeros per column cer- 
tainly cannot be obtained. 


VII. Summary. 


The paper inquires into the necessary conditions under which a 
matrix of correlations of minimal rank 7 can be analyzed into r col- 
umn factors with at least r zeros in every column of loadings. An 
approximate but sufficiently exact answer is given, which involves no 
heavy calculations, namely that if the average of the correlations “cor- 
rected” for communality exceeds (s—1) /(n—1), where n is the num- 
ber of tests and s is a positive integer, then s-factors are insufficient, 
an s-factor being one which has non-vanishing loadings in s of the 
tests, leaving (n—s) zeros. A more stringent condition is also ar- 
rived at, by applying Ridley Thompson’s “boundary conditions” to 
the matrix with correlation coefficients corrected for communality 
and with various negative integers in the diagonal. 

In passing it is pointed out that Ridley Thompson’s theorem is 
not completely stated by him, inasmuch as the sign of the determinant 
occurring in it may change several times in special cases. The last 
change is that to which his remarks apply, that is, it is the largest 
latent root of the matrix which is significant in this connection. 


specs ETT CL ie a 


CE. aera ae 














PSYCHOMETRIKA—VOL. 1, NO. 3 
SEPTEMBER, 1936 


SOME MATHEMATICAL REMARKS CONCERNING BOUNDARY 
CONDITIONS IN THE FACTORIAL ANALYSIS OF ABILITY. 


WALTER LEDERMANN, 
Moray House, University of Edinburgh. 


This paper is a mathematical supplement to the preceding paper 
by Professcr Godfrey H. Thomson. It gives rigorous proofs of theor- 
rems enunciated by him and by Dr. J. Ridley Thompson, and ex- 
tends them. Its basic theorem is that if a matrix of correlations is 
to be factorized without the aid of higher factors than s-factors 
(with n—s zero loadings), then the largest latent root of the matrix 
must not exceed the sum of the s largest communalities on the 
diagonal. 


I. 


This paper is intended as a mathematical supplement to the pre- 
ceding article* on “Boundary Conditions in the Common-factor-space, 
in the Factorial Analysis of Ability” by Professor G. H. Thomson. It 
was pointed out there (“B” II), that some difficulties arise when the 
boundary conditions first established by Mr. J. Ridley Thompson (see 
“B”, footnotes where references may be found) are applied in their 
original form. It seemed therefore desirable to revise Mr. J. R. 
Thompson’s work and to restate his results in a more mathematical 
and rigorous form. I would like to express my thanks to Professor 
G. H. Thomson for suggesting to me this problem as a piece of mathe- 
matical research. 


II. 


Both Mr. J. R. Thompson’s boundary condition and Professor 
Thomson’s criterion which is a less stringent but for practical pur- 
poses more useful substitute for it, can be based on the following 
theorem} concerning the largest latent root of certain matrices. 
If Aisa real matrix with n rows and m columns, 


ay yo *** Aim 
A a Ag, Ago +++ Aom 
|Gn1 Uno*** Gin 


*Henceforth referred to as “B”’. 
+Proved by the writer in a paper read at a meeting of the Edinburgh Mathe- 
matical Society on June 6th, 1986 and to be published later. 
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which has not more than s non-vanishing entries in each column and 
whose rows have norms 
h?,, h2, eo “* h?,, , i.e, 
Li . 
> a, = h*,; (t=1,2,---,) ’ 


k=1 


then the largest latent root of the (non-negative, symmetric) matrix 


R=AA’ 
does not exceed the quantity, 
oh? +h? +--+h, (1) 
where W? h? eee, a are the s largest norms. 


If, in particular, the rows of A be normalized, i.e. 
At, on f*, == ---o= A, = 1 ’ 


then h?, + h?,, +---+ 2’, =s is an upper limit for the latent roots 
of R. 


III. 


The application of this theorem to Factorial Analysis is obvious. 
We identify R with the matrix of correlations and A with the matrix 
of loadings. If A contains the common-factors as well as the specific 
factors (see “B” I), then the rows of A will be normalized and it fol- 
lows that all latent roots of R must be less than or equal to s, the 
maximum number of non-vanishing entries in each of the columns of 
A. On the other hand, when we confine ourselves to the common-fac- 
tor-space, the number of columns in A will be v and the norms of the 
rows of A are the “communalities” 


h?, , h?,,---, h?, ’ 


which are less than 1. Hence the upper limit o for the latent roots 
of R is in this case less than s. If w be the largest latent root of R, i.e. 
the largest root of the*determinantal equation 


A(d4) = (—)"|R—AI|=0, (2) 
then 
A(4) >0, 


when y 


A>m@.. 
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Ridley Thompson’s boundary condition may therefore be enunciated 
as follows (c.f. “B” II and IV): 

If a matrix R of correlations is to be factorized without the aid 
of higher factors than s-factors (0 = s < n), then the latent roots of 
R must not exceed the quantity 


oh? +h fe the, , 


where h?, ho ‘++ h?, are the s largest diagonal entries of R. 
or: 

If a matrix R of correlations is to be factorized without the aid 
of higher factors than s-factors, then the determinant A(A) should 
be positive when 2 > 0. 


Stating the theorem in yet another form we may say (c.f. “B”): 

If u is an integer and if the largest latent root of R exceeds u, 
or if A(u) < 0, then u-factors are certainly insufficient to account for 
the given correlations and factors with more than wu non-vanishiny 
loadings are needed. 

I would like to point out that this boundary condition—and still 
more so the following criteria—are necessarily fulfilled when s-fac- 
tors are sufficient to explain the observed correlations. But it has not 
been proved that, conversely, a factorial matrix can actually be found 
which contains only s-factors or lower factors, provided these con- 
ditions are satisfied. 


IV. 


We shall now deduce Professor Thomson’s criterion (“B” III) 
from Ridley Thompson’s boundary condition. Let w, as before, be the 
largest latent root of R; as is shown in the theory of quadratic 
forms,* w may be defined as the maximum of the quadratic form 


£R2’=> rir Ui Ue (1,kh=1,2,---,”) , (3) 
i,k 
when x ranges over all unit (row) -vectors 
X= [%1, %o,°*+,2n] , i.e. when 
Saz=1. (4) 


a 


Hence for every such vector we have 


Dd Vix Vi % Sw ° (5) 


1 


t 


*See e.g. H. W. Turnbull and A. C. Aitken, An Introduction to the Theory of 
Canonical Matrices, p. 170. (Glasgow, 1932). 
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In particular, if we take the unit vector 





ee ee eee , i.e. preity: 
n Vn 
we obtain by (5): 
1 
n = Vik = WM: (6) 
We now introduce the average correlation 
2> Tix 
— n cot 
r= > ra/( ) a (7) 
ick : n(n—1) 
and the average communality 
a h?, h2 me ‘ 
Rt sa RN . (8) 
n 
Thus we can write the left hand member of (6): 
1 Reni - 
= ao si nad [(h?, + h?, +--+ h?,) +3 riz] 
i,k ixk 
1 
-— [(h?,+h?,+---+h?,) + 2D rx » 
; ick 
since 
Tik=Tki 5 
and substituting (7) and (8): 
1 fin 7” 
—> rx —h? + (n—1)rSo.- 
Nik 
Hence 
ne w — h? 7 
"=sy—I1 (9) 


If we consider the total factor-space, we have to replace all commun- 
alities by unity and hence also 


he=1. 
Further, by §II, the number s is in this case an upper limit for w. We 
therefore get by (9): 


s—l1 


n—1 (10) 


r 


IIA 
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i) 
o—* aw (tae 
qenonaty 9th +-1)-factor 
or higher factor is essential for the analysis of the observed correla- 
tions. 


(See “B” --- ). Consequently, if 7 exceeds 





¥: 


A theorem by G. Frobenius. In order to apply Ridley Thomp- 
son’s boundary condition it is necessary to find an integer which ap- 
proximates to the largest root of the equation 


A(a) = (—)"|R—Aal|=|aAI—R|=0. (11) 


Let w be the largest root of equation (11). If u > w, then A(u) > 0, 
for A(A) does not change its sign after { has passed w, and A(i) > 0 
for sufficiently large values of 4. On the other hand, if A(u) < 0, then 
u is certainly less than ow. 

In most practical cases, the correlations i.e. the elements of the 
matrix R are positive numbers and general theorems which deal with 
those matrices become applicable. 

Professor O. Perron has proved* that the largest latent root of 
a matrix with positive elements is always a simple root so that the 
sign of 4(4) changes as 4 passes from values less than w to values 
greater than w. 

In many cases, especially when the correlation coefficients do not 
greatly differ, the following theorem by G. Frobenius} will be found 
useful. ‘| 

Let R = [rix] be the matrix of correlations in which the diagonal 
entries may either be the communalities or all equal to unity. Denote 
by 71, 12, *** , % the sums of the elements in the first, second, --- nth 
column (row). The largest latent root then lies between r,, and 7»: 


tm SoS , (12) 


where 7, is the least and 7, the greatest of those sums. 

To illustrate this theorem we shall use the same example that was 
given in “B’’, §V. In order to apply Ridley Thompson’s boundary con- 
ditions we have to estimate the largest latent root of the matrix (“B” 
SIV, (2) ) 


Vij 


alee 


]; (13) 





*See O. Perron, Algebra Bd. II, p. 87 (Berlin 1933). 
+G. Frobenius: ttber Matrizen aus positiven Elementen. S. B. preuss. Akad. 
Wiss 1908, pp. 471-476. 
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where the values of 7;;(7 + 7) and h?; — r;; are given in the table on 
p. 160. The characteristic determinant of R* then becomes: 


1—A .94 1.00 46 .81 .65 .48 .385 .49 
941—2 .96 32 .57 .46 .72 .52 .74 
(1.00 .96 1—A .43 .76 .61 .54 .389 .55 
| 46 32 481—A .78 .96 .38 .65 .34 
A*(4) = (—)*® |} 81 57 .76 .781—A 93 .15 .24 .12 
65 46 .61 .96 .931—A .29 .50 .26 
48 .72 .54 88 .15 .291—A .92 .91 
35 52 389 65 .24 50 .921—/ .89 
| 49 .74 .55 34 .12 .26 91 .891—A 


For the sums of the elements in the columns of R* we obtain: 
7, ==6.18 5; 7°,>-6238 ; 7°,—-624 3; 7.56.82 >; 755.56 ; 
r°.=<=5.66 ; 7°,=5.89 ; 1°,=5.44 ; 1r*,=5.30. 


















































































By Frobenius’s theorem the largest latent root therefore lies between 
5.30 and 6.24. This is in accordance with the fact (“B’”’ VI) that 
A*(A)is positive for 1 = 6(k = —5) and negative for 4 = 5(k = —4). 


VI. 
Special interest is attached to the case in which 


m=r and s=n—,r , 


where r is the rank of R and also the number of factors i.e. the num- 
ber of columns in the factorial matrix A. The discussion of this case 
is the main object of Professor Thomson’s paper. If we put s = n—/ 
in (1), (9) or (10), we obtain condiiions which must be fulfilled 
when the matrix of loadings is to contain not more than n—* non- 
vanishing entries, or at least 7 zero entries, in each column. 

It is perhaps of some theoretical interest to have further neces- 
sary conditions which must be satisfied in this particular case. In 
chapter VI of his book “The Vectors of Mind”, Professor Thurstone 
discusses the problem of finding a suitable set of coordinate axes with 
respect to which the matrix of loadings assumes a simple shape. The 
most general factorial matrix can be written in the form 


BAL , 


where A is a special factorial matrix and L any square matrix of or- 
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der r whose determinant does not vanish and whose columns are nor- 
malized (see Thurstone, loc. cit. p. 154 where our B, A, L, are denoted 
by V, F, G@). It is desired to determine L in such a way that B has at 
least r zeros in each of its columns. What conditions must A or AA’ 
= R fulfill if that transformation is to be possible? 





Let 
Ay Ayo +++ Ay | Ay, 
| 
as er Oar +a |_| Os (14) 
. . . . | oes 
Any Ano +++ Unr | | Qn 
where 


a, = [Qy1,Qye, ists Avr] (z — 1, 2, eile r) 


is a row-vector with 7 elements. Similarly, let 
2 


L= [l,, L,-:- my , 
where 


l= {liz lozy sss Lz} (e == 1, 2,--- 1) 


is a column-vector with 7 elements. We can then write 
Bu Diss: Dir (a, l,) (a, lL.) il (a, l,) 
pp [ban Ban -+- Bar |__| (Qa 4) (Ga be) +++ (a) 





, (15) 


Dna Due eee Our | (Qn, l,) (Gy, 1.) ig (Qn L,) 





where 


Dix = (A, 1b.) = DX i; Lin ((—=1,2,--.,”; &==1,2,---,¢) (16) 


j=1 


is the inner product of the vectors a; and ,. Suppose now that the 
factorial matrix B is of the required form. Then r entries in the first 
column of the right hand member of (15) should vanish. Let these 
be the entries: 


(a,,4) = (@,,h) == (ah) =O <a<is<a). (17) 
We can condense (14) in one matrix equation: 


(18) 





A,l,=0 ’ 
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where 

| ] 

a, | er Rs iu 

| @ | a a. ++ @& 

Av= % | = | a21 a22 aor | (19) 
| | ed ee 2 Pie Pie ’ 

a | 
awe | a, 1 %a,2 a, r | 


This is one of the (%) minors of order 7 that can be formed from the 


given matrix A. Since l, + 0, it follows that the determinant of A 
vanishes: 


|Ae|=0. (20) 
In analogy to (18) we get for the second column of B: 
Agl,=0 , (21) 


where Ag is likewise a minor of A of order 7, not necessarily distinct 
from A.. Again, we must have 


|Ag|=0. 
By repeating this argument for all columns of L we obtain 7 minors 
Ac, Ag-+++:- jis (22) 


each being of zero determinant. We shall first assume that all these 
minors are formally distinct. 

In order to express our result in a concise form we say (follow- 
ing Sylvester) that a square matrix is of “nullity” » if its order ex- 
ceeds its rank by 7; hence we have 


nullity = order minus rank . (23) 


Every matrix of non-vanishing determinant is of nullity zero, irres- 
spective of its order. If the determinant of a matrix vanishes, its 
nullity is necessarily positive (= 1). The nullity is, of course, never 
negative. Each of the minors (22) is of nullity = 1, while the re- 
maining () — yr minors are of nullity = 0. We may therefore say 


that the sum of the nullities of all minors of order r is at least r. This 
statement still holds when some of the minors (19) are identical. For 
suppose that 

Aac=Ag , 
i.e. that A. and Ag are obtained by selecting the same rows from 
A. By (18) and (21) we have then 


A,l,= ; Aal=0. 
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Since L is of non-zero determinant, /, and /. are linearly independent, 
and it follows that the set of homogeneous equations 


Acx=0 


has (at least) two linearly independent solutions. According to well 
known theorems* concerning linear equations this implies that Aa 
cannot be of rank higher than 7 — 2, or that the nullity of Ac is at 
least 2. Thus the sum of the nullities of Aa and Ag is again = 2 and 
our statement obviously holds for any number of repetitions in the 
set (22). 

It is possible to apply our test directly to the given matrix R with- 
out the aid of a provisional factorial matrix A. For by definition we 


have 
A A’= R= [rix] ’ 


where A is defined in (14). By actually performing the matrix mul- 
tiplication we obtain 


Tix = (A; A) on Qj; , (t,K=1,2,---,n) . (24) 
j=l 


Similarly, it is readily proved that 
(a, 4) (@,, %,,) ++ (@,, @,,)| 


a a a ar 


Ag A’, = (a, a,,) (a, a,,) ae (@,, @,,) — Ra, Say, 





(a, a.) (a, @,,) pe (a, a.) 


where Az is defined in (19). Substituting (24), we see that R. isa 
principal minor of order 7 formed from R by selecting the rows and 


columns with suffixes a,, a2,-:- a,, thus 
Yr Y 7 | 
a,a,; ay a: Q1a,r | | 
R. 4 oom lass a2” . thks 2a Bisa 
| 
Tr 7 | 
ara, Gra. arar 


Now, it is known (see Thurstone, loc. cit. p. 72) that Ra and Aa are 
of the same rank and hence also of the same nullity. We have there- 
fore proved the following theorem: 

If it is to be possible to factorize a correlational matrix R of rank r 


*See e.g. H. W. Turnbull, The Theory of Determinants, Matrices and In- 
variants, London and Glasgow, (1929), p. 75. 
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in such a way that the matrix of loadings contains at least r zeros in 
each of its columns, then the sum of the nullities of the (%) r-rowed 
“ 


principal minors of R must at least be equal to r. 

Consequently, if the sum of those nullities does not reach the 
value 7, the factorisation required will be impossible. Since the rank 
of R is vr, some of the principal r-minors do certainly not vanish (cf. 
Thurstone, loc. cit. p. 91). But it may very well occur and, indeed, 
will in general be the case that none of those minors vanishes, or that 
they are all of nullity zero. It will then be impossible to find a matrix 
L with the desired properties. 

A little consideration will show that the last criterion is more 
stringent than the other two given above which are, however, not 
restricted to the case s = n — r. For it may happen that the first 
two criteria still leave open the question whether a transforming 
matrix L can be found while this possibility is excluded by the last 
criterion. 
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THE FACTORIAL ISOLATION OF PRIMARY ABILITIES 


L. L. THURSTONE 


This is an experimental study of the isolation, by factor meth- 
ods, of primary abilities from a battery of tests given to 240 stu- 
dents. The range and nature of the fifty-six tests is briefly de- 
scribed. Tentative interpretations of the twelve orthogonal primary 
factors are given. 


This is a brief summary of an experimental study of human abil- 
ities. The plan of the experiment was to cover a wide range of the 
paper-pencil tests that are in current use for appraising intelligence. 
The test battery was assembled so as to include, abstraction, space, 
visualizing, form, number, reasoning, and verbal material. If primary 
abilities are present in these tests, the factorial methods should be 
adequate for their isolation since the battery was fairly extensive, in- 
cluding fifty-six tests. 

The complete battery required fifteen hours of testing time. The 
subjects were 240 student volunteers at the University of Chicago. 
All the subjects who were included in the factorial analysis completed 
the whole battery. Only 9% of the volunteers failed to finish all the 
tests. Most of the subjects were freshmen and sophomores at the Uni- 
versity of Chicago, but the experimental group was not strictly con- 
fined to them. A few older subjects were allowed to participate. Each 
of the volunteer subjects who completed the whole battery was given 
a report on his performance on each of the tests in comparison with 
the rest of the group. In making these preliminary reports the tests 
were grouped as well as could be done in the present state of knowl- 
edge without factorial analysis by the constituent elements. 

The nature of the tests will be described here with a brief men- 
tion of each form. Most of the tests were designed in three parts, 
namely, the instructions with examples, a fore-exercise, and the test 
proper. The fore-exercise was not necessary for a few of the tests, 
such as spelling and arithmetic, in which the nature of the task was 
readily understood from general instructions. The number in paren- 
theses after each test is the code number by which the test is identified 
throughout the analysis. 
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List of Tests in the Battery 


Matched Proverbs (4) for similar meaning, 


Matched Quotations (5), 
These two tests were designed by the writer at Carnegie In- 
stitute of Technology and they have been used by Professor 
Thorndike in the CAVD tests. 


Verbal Classification (6) was intended to be a verbal parallel to 
Spearman’s 

Figure Classification (8), 

Word Grouping (7) in which the subject selects the odd word in 
a group of five words, 

Controlled Association (9) in which the subject is asked to write 
as many words as he can recall whose meaning is similar to 
a given word, 

Inventive Opposites (10) and Inventive Synonyms (16) in which 
the subject is asked to recall and to write the response words, 

Completion (11) is similar to the test with the same name in the 
Psychological Examination of the American Council on Edu- 
cation. 

Disarranged Words (12) in which the subject rearranges pied 
letters of a word, 

First and Last Letter (13) in which the subject writes as many 
words as he can recall that begin and terminate with speci- 
fied letters, 

Disarranged Sentences (14) was adapted from the Army Alpha 
test, 

Anagrams (15) is similar to other tests by the same name, 

Block Counting (17) is adapted from a test by MacQuarrie, 

Cubes (18) is adapted from a test by Brigham, 

Lozenges A (19), Lozenges B (22), and Flags (20) were de- 
signed by the writer some years ago as tests of space think- 
ing, 

Form Board (21) is the Minnesota Paper Form Board which 
was here used with permission of the authors, 

Surface Development (23) is a new test of space thinking in 
which the subject is asked to make comparisons between a 
flat cardboard or sheet metal form and the sketch of the ob- 
ject into which the sheet can be folded, 

Punched Holes (24) is a development of the well known test, 
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Mechanical Movements (25) is an assembly of pictures of me- 
chanical movements in which the subject is asked to specify 
the movement of certain parts when that of other parts is 
given, 


Identical Forms (26) in which the subject must find one of five 
designs which is identical with a given design, 


Pursuit (27) and Copying (28) are adapted from two tests by 
MacQuarrie, 

Areas (29) is adapted from a test by Brigham, 

Number Code (30) is a new number test which involves the use 
of the Mayan number code, 

Addition (31), Subtraction (32), Multiplication (33), Division 
(34) are simple speed tests of arithmetical computation, 

Tabular Completion (35), to fill in the missing entries of a table 
by inspecting the other entries and the column headings of 
the table, 

Estimating (36) is a modified form of a test of the same name 
which was used in the 1924 Psychological Examination of 
the American Council of Education, 

Number Series (87) is a well known form of test, 

Numerical Judgment (38), to select one of four given numerical 
answers as the correct one without carrying out the complete 
computations. In most of the problems the answer can be 
given by noting the order of magnitude of the numbers so 
that detailed computation is superfluous, 

Arithmetical Reasoning (39) is a test in current use, 

Reasoning (40) contains a series of syllogisms and False Prem- 
ises (42) is a syllogistic test in which all of the terms are 
meaningful words which, however, do not make meaningful 
premises or conclusions. The subject must ascertain wheth- 
er the conclusion follows from the premises although both 
are apparently nonsensical statements, 

Verbal Analogies (41) is similar to the well known form, 

Code Words (48) is an extension of a reasoning test by Godfrey 
Thomson, 

Pattern Analogies (44) is similar to one of the tests in the Psy- 
chological Examination of the American Council on Educa- 
tion, . 

Syllogisms (45) is characterized by monotony of content, 
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Rote Learning I (46), to memorize paired associates. The stim- 
uli are words and the responses are two digit numbers, 
Rote Learning III (47) consists of memorizing paired associates 

with names as stimuli and initials as responses, 

Rote Learning II (48) consists of memorizing paired associates 
with two digit numbers as stimuli and other two digit num- 
bers as responses, 

Word Recognition (49) is a recognition memory test for individ- 
ual words with specially selected lists of distractors, 

Figure Recognition (50) is a recognition memory test for simple 
geometrical figures with similiar distractors, 

Picture Recall (51) calls for memory of the detail of a picture, 

Theme (52) is a short theme in which the subjects were asked to 
describe an acquaintance. The themes were rated by English 
instructors, 

Hands (53), to indicate for each of a large number of pictures 
of hands whether it is a right or a left hand, 

Rhythm (54) is a test with groups of four lines of poetry. The 
subject is asked to select the one line whose rhythm is dif- 
ferent from the other three, 

Sound Grouping (55), to indicate which one of each group of 
five words sounds different from the other four, 

Spelling (56), 

Grammar (57), 

Two Vocabulary tests (58 and 60) are well known forms. In 
the Grammar test the subject was asked to correct faulty 
sentences by altering only one word, 

Free Writing (59) was merely the word count in the theme. 


Most of the frequency distributions of raw scores were clearly 
unimodal and approximately symmetric. In the present study it is 
assumed that whatever the primary abilities may be they are nor- 
mally distributed in the experimental population. If this assumption 
is true then any linear combination of the primary abilities must also 
be normally distributed. With this assumption a deviation from nor- 
mality in the distribution of raw scores is regarded as an arbitrary 
effect of the degree of difficulty of the items. This assumption seems 
to be the safest that we can make about these psychological functions. 
It is certainly less arbitrary than to assume that the distribution of 
raw scores is itself significant since we know that such distributions 
can be altered at will by changing the proportions of items of differ- 
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ent degrees of difficulty. The ideal procedure would therefore seem to 
be to normalize all of the distributions before entering them into cor- 
relation tables. The most complete procedure would be to compute 
Pearson product-moment coefficients. The assumption of the nor- 
mality of the underlying primary distributions and the resulting nor- 
mality of their linear combinations fits exactly the assumption that 
underlies the tetrachoric correlation coefficient. It is, in fact, an 
estimate of the Pearson product-moment coefficient for pairs of nor- 
mally distributed variates. Tetrachoric correlation coefficients were 
used in this investigation. They were computed by dividing each of 
the variates at the median, at which the determination is more accu- 
rate than when the division is made near either tail of the distribu- 
tion. The tetrachoric correlation coefficients were determined by fa- 
cilitating tables* by which four determinations of the coefficient can 
be made in a few minutes after the appropriate four-fold tables have 
been assembled from a Hollerith tabulator. This procedure sacrifices 
accuracy in that the probable error of a tetrachoric correlation is 
larger than the probable error of a Pearson product moment coeffi- 
cient. However, this procedure is superior to that of computing Pear- 
son product-moment coefficients for the raw scores in case some of 
the distributions deviate noticeably from normality. This work is 
largely a matter of computational routine. 

The correlational matrix was of order 57x57 and it was factored 
to twelve dimensions by the centroid method.} 

By far the largest part of the labor is in the rotation of a twelve 
dimensional configuration so as to bring it into a positive orthogonal 
manifold. This work has been done by several analytical methods and 
by graphical methods of determining orthogonal transformations. The 
graphical methods have been in use for several years and a complete 
example will probably be published in a later issue of Psychometrika. 

The factorial analysis of a correlational matrix consists essen- 
tially of two parts. The first problem is to factor the correlational 
matrix. In the present case the factorial matrix is of order 57x12. 
The factoring can be done routinely by the centroid method or by any 
other method which does not disturb the minimum rank of the cor- 
relational matrix with unknown diagonals. The second major part of 
the problem is to rotate the configuration into a unique position with 
reference to the coordinate axes so that the axes shall be scientifically 


*Chesire, Saffir, Thurstone, Computing Diagrams for the Tetrachoric Cor- 
relation Coefficient, (The University of Chicago Bookstore, 1933). 

+L. L. Thurstone, The Vectors of Mind (Chicago: The University of Chicago 
Press, 1935), Chapter III. 








180 PSYCHOMETRIKA 


meaningful. The unique solution is attained for psychological abilities 
when the factorial matrix has been so rotated that the number of zero 
entries is maximized and so that the non-vanishing entries are posi- 
tive. An arbitrary factorial matrix cannot be rotated so that a large 
proportion of the entries become vanishingly small and so that the 
non-vanishing entries are positive. When this can be done the test 
battery has revealed an underlying order. It is the object of factorial 
analysis to discover that order. 

When the factorial matrix of the present problem was rotated 
into a positive orthogonal manifold, the entries in the columns of the 
factorial matrix were studied to obtain a psychological interpretation 
of each factor. Seven of the primary abilities seem to have fairly clear 
psychological meaning. Two others can be given a tentative psycho- 
logical interpretation. One of the factors seems to be an error factor 
and two of the factors do not seem to be sufficiently conspicuous in 
the present battery to allow clear interpretation. 

Perhaps the most conspicuous of the factors is the one which has 
strong factor loadings on all of the tests (30-39) inclusive except 
(36). The highest factor loadings are on (30), (31), (32), and (33). 
A glance at the corresponding tests reveals that this is the whole num- 
ber group. All of the tests without exception that are high on this fac- 
tor are number tests. The simplest number tests have the highest fac- 
tor loadings in this factor. The more complex number tests, namely, 
(35-39) inclusive have relatively lower factor loadings on this factor. 
We seem justified in denoting this factor N and if we should name it 
we should naturally call it a number factor. At this point it may be 
well to call attention to the fact that the naming of a factor is always 
necessarily limited to the available tests which represent it. It may 
happen that this factor which we have denoted N or number might 
transcend those functions which we ordinarily refer to as numerical. 
If we should discover some other tests which also have high loadings - 
on this factor and which are non-numerical in character, then our 
psychological interpretation of this factor would have to be extended 
to some more fundamental function which would still include numeri- 
cal operations as good examples but which would not necessarily be 
limited to numerical operations. 

The next conspicuous factor has a heavy loading in the following 
tests: (18), (20), (22), (23), and (27) and it has marked factor 
loadings also in (8), (17), (19), (21), (45), and (53). If we intro- 
spect about the mental activities that are involved in doing these tests, 
it seems clear that the common element is visualizing. Flat space and 
solid space are both included in this factor. There seems justification, 
therefore, for denoting this factor V or visualizing. 
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Another factor which seems to be rather easily interpreted psy- 
chologically has large factor loadings in tests (46-48) inclusive and 
appreciable factor loadings in (49-51) inclusive. These are the six 
memory tests in the present battery. It is of some interest to note 
that all six of these tests combine into a single memory factor. Al- 
though two of them involve numbers they do not appear in the num- 
ber factor. However, work with specially designed batteries is indi- 
cated to ascertain whether memory can be split into several distinct 
memory factors. This factor has been denoted M and it might be 
named memory in terms of the tests by which it is identified. 

Another factor has large loadings for tests (12), (15), (56), and 
(57) and appreciable factor loadings in (10), (13), (16), (49), (55), 
and (60). In all of these tests without exception the subject deals 
with single words. None of these tests is characterized by sustained 
verbal thinking. It has therefore been denoted W, in view of the fact 
that another verbal factor is also found to be distinct from this word 
factor. It might be named word facility or word fluency. 

Another verbal factor has high factor loadings in tests (4), (5), 
(7), (9), (10), (16), (41), (57) and it has appreciable loadings in 
(13), (14), (40), (42), (55), (58), and (60). All of these tests are 
characterized by the fact that they involve word meanings or word re- 
lations. It seems to be more logical in character than the factor whose 
chief examples are Disarranged Words and Anagrams. Since it is 
distinctly verbal in character it has been denoted W,. and it might be 
named verbal relations to distinguish it from the less logical factor 
that is implied in W, It would be interesting to investigate these two 
factors experimentally in relation to some forms of aphasia. It is like- 
ly that experimental studies of abnormalities may enable us to clarify 
the psychological nature of these factors more definitely than experi- 
ments on normal subjects. 

A more restricted factor has high loadings in (6), (7), (26), and 
(51), the highest loading being for Identical Forms. Considerable 
study of the tests that have high and appreciable loadings on this fac- 
tor seems to indicate the psychological interpretation that it involves 
facility in a type of perceptual discrimination. A special test battery 
is now being arranged to study this factor in further detail. It seems 
to be the factor by which some individuals can scan a field of percep- 
tual detail to identify readily what they are looking for. The factor is 
definitely perceptual in character and it is not limited to sensory 
acuity. It has been denoted P and it has been tentatively called per- 
ceptual speed although this name is cee to revision with further 
experimental study. 
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Perhaps one of the most interesting factors is that which is 
markedly present in tests (8), (29), (35), and (37). The latter is a 
Number Series test and (8) is Spearman’s Figure Classification. In 
all of these tests the subject is asked to find some rule or principle by 
which given perceptual material is unified. Because of this inductive 
characteristic the factor has been denoted J and it has been tentatively 
named induction. It calls for further experimental study with test 
batteries specially designed to feature induction. 

The appearance of induction as one of the primaries makes one 
immediately wonder whether deduction can be found in one of the 
other factors. This seems to be indicated in the fact that one of the 
factors has conspicuous loadings in Reasoning and in False Premises 
and appreciable loadings in Mechanical Movements and in Spearman’s 
Figure Classification test. All four of these tests have in common the 
characteristic that certain premises are given and that one of several 
given inferences must be selected as correct. This factor has there- 
fore been denoted D and it has been tentatively named deduction. If 
Reasoning appears with at least these two factors it would seem to 
follow that some individuals might be superior in deductive reasoning 
without being superior in induction and vice versa. This might be 
psychologically defensible. 

One of the remaining factors is heavily loaded with Arithmetical 
Reasoning, Arithmetical Judgment, Vocabulary tests, Mechanical 
Movements, Estimating, Reasoning, and False Premises. This factor 
is more difficult to name but it may be characterized by some elements 
of precision. 

All of these primary factors are uncorrelated in the experimen- 
tal population. 

A four hour test program involving these primaries is now being 
assembled for use during Freshman Week at the University of Chi- 
cago. We expect to rate each of the students separately on each of 
these primary abilities. 

Before we shall be certain of the psychological characterization 
of these primary factors it will be necessary to study experimentally 
specially designed test batteries in which each of these primaries is 
featured in turn. Such studies will eventually refine our knowledge 
of the exact nature of each of the primary mental abilities. 
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THE METHOD OF MINIMUM VARIATION FOR THE 
COMBINATION OF CRITERIA* 


HAROLD A. EDGERTON AND LAVERNE E. KOLBE 


Occupational Research Program U. S. Employment Service 
Washington, D. C. 


The problem of weighting separate criterion variates is solved 
by minimizing the differences among the standard scores of the in- 
dividual upon the various measures. The method is compared with 
Horst’s procedure of maximizing the inter-individual differences. 
An application is made to personnel data. 


How to combine several criterion variates, estimates of the de- 
gree of excellence of performance of a task, into a single composite 
has had a number of solutions — all of which are necessarily con- 
cerned with the determination of the proper weights to assign the 
criterion variates. These include weighting the several variates pro- 
portionally to their importances as judged by “experts,” weighting 
proportional to the reliability of the variate, weighting proportional 
to the average of the correlations of the one variate with the remain- 
ing variates, weighting the criterion variates so as to obtain a maxi- 
mum correlation with several independent or predicter variates, ex- 
tracting by factor analysis those parts of the criterion variates which 
may be ascribed to the same factor, or weighting the criterion vari- 
ates so that the discrimination between all possible pairs of individ- 
uals in the sample will be as great as possible. 

The list of methods is not all inclusive, but is presented to sug- 
gest some of the ways in which the problem has been met. 

Any method for combining several criterion variates into a sin- 
gle composite assumes that they are measures of the same thing or 
various aspects of the same thing. From this assumption it follows 
that one should expect the differences among the variate values for 
any individual, expressed as standard scores, to be small. This forms 
the basis for the method of minimum variation for combining several 
criterion variates into a single composite. 


*This technique was developed in connection with a study of the prediction 
of success of salespersons, which is being carried on by the Occupational Research 
Program of the United States Employment Service. 
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Suppose that there are given n estimates of competence in the 
performance of a task for each of N individuals, each estimate being 
expressed as a standard score z; whose weight a; is to be determined. 

Let it be assumed that the best linear combination of these n 
criterion variables is that one which makes the sum of squares of the 
n(n—1) /2 differences of the standard scores for any individual a 
minimum. 

Let, 


F (ai) = [ (a2, nZo)® +. (AyZ, — Az2%3)? + +++ (Qn-rZn-1 — AnZn)?] 





(1) 
where the a’s are connected by the relation, 
H(a;) =a?+a7+-.-+14a/7/=h , k~0 (2) 
The function to be minimized is then 
S=F(a;) + BH (ai) (3) 


where # is a Lagrange multiplier. 
The first partial derivatives of S with respect to the independent 
variables @,, dz, +++ , Gn are set equal to zero: 











6s " 
oe (n—1-+) 0,21? — 0y21%. — M32 123 — +++ — An 21%, = 0 
g | 
pore (4) 
6S , 
eae (N—1-LB) AnZn? — 1212 — U2 2% n — ++ — On-12n-12n = 0 
n | 


Summing these derivatives for the N individuals, n normal 
equations are obtained. E.g., the first equation of (4) becomes 


“s = (n—1+-B) G21? — G22 212%2 — ++» — On de2i= 0 (5) 
Since ¥z;* = N and Sziz; = N7v;;, the normal equations may be 
written 
6S 
2 PA oa, == (n—1+£) Ay — 1122, — ++ — Vi cy1yn-a — V1 nh, = 0 
+ (6) 
— . a,—Y?. ay — + **—P (n-1) nna + (N—1-++ 8) = 0 
N— oa, in“1 2n ’ sd 
J 
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In order that these n equations hold simultaneously the determi- 
nant of the a’s must equal zero. 


| m—1+f8 —riz rot re —l in 
| ounsaffl” n— oe Tt —ls 
A (a) == | Tie 1+5 . 2(n-1) a = 0 
| (7) 
| 
, , , LB | 
| —Tin ern aa n—1-+-6 | 





A(a) vanishes for each of the x real negative roots of 


Cob" + Cpr + Cop? + C=O , (8) 


the expansion of 4(a). 

The coefficients of 6 in (8) are obtained by evaluating the prin- 
cipal minors of 4(a):C, = 1, C, equals the sum of the first order 
principal minors, C, the sum of the second order principal minors, 
--», C, the sum of the nth order principal minor. 

If C, equals zero, one value of f is zero, showing that the normal 
equations were consistent and could be solved immediately for the 
weights a;. If C, does not equal zero, (8) must be solved for the root 
nearest to zero (thus making a minimum correction to (6)). This 
value of 6 should be substituted in (7) and the determinant proved 
to equal zero, before substituting it in (6). 

Since there are n equations with n unknowns and all the con- 
stants are zero, only the ratios of the weights can be obtained. By 
arbitrarily giving any weight a definite value the other weights are 


/ te 
fixed by the ratios obtained. E. g., the ratios =, oe : 


n nN 





, can be 


computed by dividing each of the normal equations (6) by a, and 
solving (n—1) of them. The ratios will satisfy the nth equation also. 
The best estimate of an individual’s excellence of performance, 
which is the linear combination of the n estimates weighted so that 
the sums of squares of the differences among all pairs of them is a 
minimum, may now be written 
C= 0421 + MeZe - +++ + AnZn ’ 


or in terms of the original units, X,,---, Xn, 
a. a. Ay a; 
T = —X,+—Xe+---+—Xn+ (Mo—=T— Mi) - 
O71 02 on Cj 


Example: The correlations among five variates: cost per cent, num- 
ber of sales per day, total net sales per day, net dollars per sale, and 
ratio of actual credits to total net sales; taken from a department 


“ie 
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store study of the Occupational Research Program are here shown 
substituted in (7). 


| 448 .049  .160 —.026 —.098 | 
| 049 448 —436 .637_—_.480 | 
| 160 —.436 446 —289 .151 | =0 
| 026 .687 —.289 446 —.408 | (9) 
—.098 480 .151 —.408 44, | 
Expanded, this becomes 
6° + 206* + 158.8627f° + 626.2180/? + 1224.45718 + 949.5588 — 0 . 


The root nearest zero is found to be 6 = —2.9272. Substituting this 
ay A, as a, 
—,—,—, and — 
a; Gs Os a; 
are found to be .2305, —1.2132, —.3991, and .9987 respectively. If 
a; be arbitrarily assigned the weight of unity, the composite score for 
any individual in the sample is 
== .2305z, — 1.2182z, — .8991z, + .9987z,-++ Zz; . 

Horst* developed a method for combining criterion variables so 
that the standard deviation of the composite scores, or the discrimina- 
tion between all possible pairs of individuals in the sample, shall be 
a maximum. His normal equations were written, 


(1—y)a, ae T 2M a pee a indy = 0 ’ 


value of 6 in the normal equations (6) the ratios, 


Pinch: saa sy ee BE oe hs ee (10) 
Tn Qy + Tonle + Pre + (1—y) dn =O. 
where y is the variance of the composite scores for the group. 

It may be seen that if (10) be multiplied through by minus one, 
the equations are similar to (6), and y — n+. The ratios of the 
weights obtained by Horst’s method and the method of Minimum 
Variation are identical. 

The above example was also worked by Horst’s method. The de- 
terminant expanded was 


— yp + dy — 8.8627 y? + 6.7220y? — 1.9775y + .1107=0 . 


The largest of the five real positive roots is y = 2.0728. 
Testing both functions by means of the second partial deriva- 
tives, it is seen that in the method of Minimum Variation $ must be 


*Horst, Paul. Obtaining a Composite Measure from a Number of Different 
Measures of the Same Attribute. Psychometrika, 1936, 1, 53-60. 
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< n—1 for a minimum, and in Horst’s method y must be > 1 fora 
maximum. The values of # and y obtained in the example satisfy 
these inequalities. 

Since the two methods result in identical sets of weights, it might 
be interesting to compare them on other bases. Horst’s function was 
derived so that the composite would show a maximum differentiation 
among individuals. This is statistically a desirable condition. The 
method of Minimum Variation makes the assumption that the differ- 
ences among the n standard scores for an individual are a minimum. 
Apparently, the two approaches, with considerable difference in the 
logic of their derivations, are complementary. 

From the computational point of view it is simpler to find the 
negative root nearest zero for the value of § in (8) than to locate all 
the positive roots for y in Horst’s equation to be sure the largest was 
obtained. Horst gave an approximation method for finding the 
weights when is large: 


Q, = CDSriy Ap = CDi, 9 +++ On = CS7i, - 
The approximate weights obtained for the above example, letting 
C = om » were 1.0457, .3086, 1.6160, 1.2411, and 1, showing no 
i5 


agreement with the weights obtained by the exact solution. 


A 
a 
ie 
a 4 
3 
bea 
Ww ’ 
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THE CONTENT RELIABILITY OF A TEST 


HAROLD GULLIKSEN 
University of Chicago 


. The content unreliability of an essay test is the error due to the 
items used or the content of the test. The reader unreliability is due 
to vaciation in judgment of the persons who read and score the essay 
test. The content reliability of an essay test is accordingly defined 
as being independent of the reader reliability. Formulae are derived 
for the reader reliability and for the content reliability. The content 
reliability is found to be equal to the geometric mean of the test re- 
liabilities computed from the scores assigned by the two readers, 
divided by the reader reliability. 


In comparing the reliability of an objective test with that of an 
essay test, the interpretation of the results is complicated by the fact 
that there is one source of error in essay tests which does not appear 
in objective tests, namely, the unreliability due to variation in judg- 
ment of the reader. For the purpose of analysis, it may be said that 
the unreliability of an objective test is due only to the content unre- 
liability, i.e., the error due to the items used or content of the test, 
while the unreliability of an essay test is due to two factors, content 
unreliability and reader unreliability. 

Questions such as the following arise in comparing essay and 
objective test material. If an objective test with a reliability of .90 
were changed into essay form, and could be graded with a reader re- 
liability of .80, what is the expected reliability of the essay test? If 
an essay and objective test are being compared and it is found that 
the essay test has a reliability of .60 and a reader reliability of .80, 
while the reliability of the objective test is .90, can it be said that the 
difference in reliability is due solely to the variability of reading? A 
further analysis of the concept of reliability is needed in order to 
answer such questions. 

For the purpose of analyzing and comparing the reliabilities of 
an essay and an objective test, it is necessary to distinguish three dif- 
ferent types of reliability: 


total test reliability or observed reliability, obtained by correlat- 
ing two comparable forms of the test; 


reader reliability, obtained by correlating the scores assigned by 
one reader with those assigned by another on the same ma- 
terial ; 
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content reliability, which is the same as test reliability in the case 
of an objective test but must be determined by further anal- 
ysis in case of an essay test. 

The test reliability is usually defined as the correlation between 
two comparable forms. However, in the usual case, two comparable 
forms are not available so the reliability is determined by dividing 
the examination into halves, correlating one with the other and cor- 
recting for length by means of the Spearman-Brown formula. The 
reader reliability, on the other hand, is computed without the use of 
the Spearman-Brown formula. This fact, together with the desirabil- 
ity of allowing for the possibility that the error of one reader may be 
greater than that of another, makes it desirable to derive the content 
reliability, using test reliability computed by the split-half method, 
and correcting for length. 

Let the two halves of the test be designated by the subscripts a 
and b and let the scores assigned by the two readers be designated by 
by ’and”. If we now regard any given test score as the sum of three 
quantities, 


t, the true score, 
c, the error due to the test content, and 


e, the error due to the variable judgment of the reader, 
we have the following definitions: 


Definitions 
wa=t+C+ea ;» 
wo=tt+a+e%., , 
ry=t+toate’, , 
zy%=tto+e”,. 


For convenience we may use y to designate the test score as freed 


from reader error, 
Ys=t+a , 


Y=t+o, 
and designate the total score assigned by a reader as follows: 
w= 2g + HX'y , 
xe Ht UM" 


These symbols all represent deviation scores. 
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Certain assumptions made during the derivation will be set down 
here for completeness. 


Assumptions 
a) All intercorrelations between t, ¢,, Cs, €’a, €’a, €’s, and e”, 


are zero. . 
b) Sec?, = Sc*,. These will both be designated by Sc? 


(without a subscript). 
c) Similarly Se’.2 = Se’,?, both designated by Se”, and 
d) Se”.? = Se”,?, both designated by Se” . 
e) Se” is in general different from Se” . 


Assumption a) states that the errors are independent of each 
other and of the true score. Assumptions b), c), and d) state that 
the error variance in one-half of the test is equal to the error variance 
in the other half. This is a part of what is involved in saying that 
the two halves are comparable. Assumption e) means that one reader 
may be much more accurate than the other. 

The quantity to be solved for is the content reliability which will 
be designated by 7,. Applying the Spearman-Brown formula gives 





21 y.» 
=> . 1 
1 + Ty cur ( ) 
Then from the definitions we have 
= (b+- a) (E+ ep) (2) 








Ty av, = 
i VSCEF €:)? S (t+ 6)? 
Expanding and using assumptions a) and b), we get 


ae 
Tyavr = SP - Xe? ¥ (3) 
Substituting N times the variance for the sum of the deviations 
squared and dividing both the numerator and denominator by N, we 
get 


eee: ae (4) 
Yao —— C?; + C2, 


Substituting (4) in (1) and simplifying, we get 


2c?; 
"e8e +e, o 
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This is the content reliability. The problem is to express it in terms 
of the reader reliability and total test reliability. 
The reader reliability (7,) is given by 


i= yore . (6) 
From the definitions it can be seen that 
= (2¢ + Ca + Cy ++ Ca + e’y) (2¢ + Ca + Cy + ea + e”s) 


= e 
V>(2t + Ca + Cy + C’a + €%)* J (2E+ Cg + Cp + 0% + Cy)? 
(7) 








Expanding and applying assumptions a) to e), inclusive, we get 
— AS + 230? 
"  V 4358 + 2352+ 25e”) (458 + 2c + 25) 


Substituting N times the variance for the sum of the deviations 
squared and dividing numerator and denominator by 2N, we get 


a 2c?; + c?, 
V (2c?, +- c?, + c?,,) (2c?, + c?, + c?,,) 


from which we have 


207; + Cer rr V (2c? ++ ©? + c?,,) (2c?, + c? + c.,) (10) 


The total test reliability, computed from scores assigned by read- 
er 1, will be designated by 7,, and is given by the formula 
gf on 
jo a : (11) 
, from the definitions is 
. re (t+ Ca + e’a) (t+ Cy + ©’) 
Wer VEE Ca + Cc)? S (EF Cy + e's) 


Expanding and applying the assumptions a) to e), inclusive, we get 


(8) 








Tr 








(9) 





7 


The value of 7 


2’ gt’ 


(12) 








! Bt 

"eae SEL Se Se 
Substituting N times the variance for the sum of the deviations 
squared and dividing numerator and denominator by N, we have 





(13) 


c- 
c= "i. (14) 
Lal's e, + . + ae 


Substituting (14) in (11) and simplifying, we get 

















HAROLD GULLIKSEN 193 
2c*, 
vor — A 29 2 ‘4 
2c? , “— ail" ss Ce 


Similarly, the reliability of the test as scored by reader 2 will be rep- 
resented by 7,,,. This is given by 





(15) 





D9p2 
Ce .: (16) 
ow 8 He pe, 


To allow for possibility that one reader may read more accurately 
than another it will be assumed that c*,, ~ c?,, so that 7,, and 1,, 
need not be identical. From (15) and (16) we have 


2¢?, = VT 4 Nn V (2c?, + c?, + c7,) (2c?, + c?,-+ c2.,) - 





(17) 
Substituting (17) and (10) in (5) and simplifying give 
It? 
Y= Viet ow P (18) 
T, 


in which 7, is the content reliability, 
r,, is the test reliability computed from the scores of reader 
1 by the split-half method corrected for length by the 
Spearman-Brown formula, 


r., is the test reliability similarly computed from the scores 


assigned by reader 2, and 
7, is the reader reliability obtained by correlating the total 
test scores assigned by one reader with those assigned by 
another. 
The resemblance between formula (18) and the correction for 


attenuation can easily be seen. 
If the two readers are equally accurate, 


T,=1,=f, . (19) 
Substituting (19) in (18) gives 


tenet , (20) 
in which r, is the content reliability, 
r, is the test reliability, and 
r, is the reader reliability. 
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Formula (20) will now be used to answer the illustrative ques- 
tions raised at the beginning of the paper. If an objective test with a 
reliability of .90 were changed into an essay form which could be 
graded with a reader reliability of .80, how high would we expect the 
observed reliability of the essay test to be? Since the observed and 
content reliability are the same in the case of the objective test, we 
know that its content reliability is .90. If the content reliability of 
the essay test were the same (.90) and its reader reliability .80, these 
values may be substituted in formula (20), giving 

To 

.90 — — , 
from which 

7, equals .72 . 


Therefore it can be said that the reader reliability of .80 would be ex- 
pected to lower the observed reliability of the essay examination to .72. 

At no point in this derivation is it assumed that the content re- 
liability of a test is independent of a change in the form (e. g. objec- 
tive to essay) of a test. The formula given enables one to correct for 
reader error and so find out whether or not the content reliability has 
been altered by a change in the form of the test. 
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THE FOUNDATIONS OF PSYCHOMETRY: FOUR 
FACTOR SYSTEMS. 


WM. STEPHENSON 


(From the Psychology Laboratory, University of London). 


Four methods of factorizing the fundamental matrices used in 
factor analysis are described and illustrated. The first is repre- 
sented by the techniques already developed. The second is the ob- 
verse factor technique. The third and fourth methods are variants 
of the first and second. The implications of each method for differ- 
ent schools of psychology are pointed out. The methods are comple- 
mentary, not competitive. 


Contents: 
Introduction: 
The Four Systems. 
An Example. 
Two Methodological Approaches. 
Further Observations. 
Conclusion. 


Introduction. 


I am to draw attention to four different ways in which matrices 
of the kind used in factor analysis can be standardized and analyzed 
for factors. In the past only one of these four has been clearly dis- 
tinguished, namely, that for the factor technique of Spearman, Thur- 
stone, Kelley and others in which tests etc. are variables, and persons 
are the population. The second, the obverse or inversion of the first, 
was described recently ;* it employs persons as variables instead of 
tests or other single attributes. Two others, of minor immediate in- 
terest and subsidiary to the first pair, will be described in the sequel. 
The four together, I believe, supply the foundations for a considerable 
part of the future science of psychometry. 

In the present paper , after defining the systems and supplying 
an example of each, I proceed to examine the modes of thought that 
each can subserve. The four help us to see in perspective what, I be- 
lieve, has not been perceived clearly hitherto. Two of the four are 
statistical statements of a relativistic standpoint, and may be of help 
to gestalt theorists, whilst the other two are parallel foundations for 


*Stephenson, W. The Inverted Factor Technique. Brit. J. Psychol. XXVI, 
1936, p. 344. (Other papers are referred to in the sequel). 
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‘elemental’, process,* or ability psychology. Psychologists of the form- 
er school of thought have claimed that theirs supplies the only scien- 
tific procedure.} Psychologists of the latter school draw attention to 
the “‘irrelevancies of experience” in gestalten, and hold that it is pre- 
cisely characteristic of scientific methodology that it does not engulf 
the “total situation”.§ I shall not need to take sides in this contro- 
versy. Of the four systems two subserve the former viewpoint, and 
two the latter; but no one of the four systems, nor either of the psy- 
chological standpoints, is sufficient or adequate alone. The future 
psychometry cannot hope to be complete unless it uses all four sys- 
tems, each for ends which it is best fitted to serve. 


The Four Systems 


We are to be concerned with matrices of the following kind: Each 
is to contain persons in the row (persons A, B, C --- N) and items 
such as personality traits, tests, measurements or attributes in gen- 
eral, in the column (attributes a, b, c,--- 1). 





Persons 
= 2 eS pee | 
° | a 

8 a | me © aXpB aXe oe ee ee aAy 
$ b bX 4 rAB pXc e-° oe ee pA y 
~ 
4 a 
4 c cX A cAB eke a ee oe cAyn 
s 
~ 
<x 

n nXA nA B nX¢ oe ee o- nd&dy 








The variate of person A for item a, or of item a for person A, is 
represented by .X.4, and the same connotation is held throughout. The 
four systems or ways of treating such matrices are as follows:— 

Variates can be entered in the matrix for single attributes or 
tests etc. (such as a), for standardization with respect to the popu- 
lation of persons. This is what has been done, of course, in all pre- 
vious factor analysis. Data is standardized along the row, so that 
for each attribute or quality a to n respectively, the following holds: — 


*Line, W. Process-Psychology — Individual and General. Psychol. Rev. XL. 


p. 256 
yLewin, K. Dynamic Theory of Personality, Chap I. (McGraw-Hill) 1935. 
§Thurstone, L. L. The Vectors of Mind. p. 46. (Chicago Univ. Press )1935. 
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Dt ee Zt ee+ er 2 eo 1. (1) 


(where the summation is from x, to xy in each case). Rows are cor- 
related, and analyzed by way of factor theorems such as those of 
Spearman, Thurstone, Hotelling, Kelley, and others. 

Secondly, variates can be entered into the matrix for standardi- 
zation in columns, for a population of attributes, so that for each per- 
son the following holds:— 


2, ta Ge ag ee --- ee FO y* == I e (2) 


(where scores are summed in each column, from ,% to ,x). Correla- 
tions are calculated between columns, for persons or whole aspects of 
persons as variables, and factorized by way of inverted factor theo- 
rems, that is, obverses of the theorems used in system (1). 

Thirdly, variates which have been standardized as at (1) can be 
entered into the matrix, and thereupon re-standardized by way of 
system (2). That is, using y to stand for variates that have already 
been standardized in rows, the same variates can be given the fol- 
lowing standardization per column:— 


2 Veo Sve. = Fy? = (3) 


(where the summation is per column, from ,y to »y). Correlations are 
for columns, and these, as in system (2), are factorized by way of in- 
verted factor theorems. 

Fourthly, scores already standardized as at (2) can be entered 
into the matrix and used for re-standardization by way of system 
(1). Re-standardization is effected with respect to the population of 
persons, for traits now considered separately as variables, the crude 
variates for which were standard scores with respect to a column. 
Using z to stand for variates already standardized in columns, the 
following represents this system:— 


DoS et: = Dee"? — 1 (4) 


(where the summation is now per row, from 2, to zy). Correlations 
are between rows, and are factorized by way of the usual Spearman 
or other factor theorems in their initial form, as in the case of sys- 
tem (1). 

Systems (1) and (4) deal with separate* attributes as variables; 
(2) and (3) with persons. Factor measurements for an individual 
are regression estimates by way of systems (1) and (4); they are 


*The word does not imply that the attributes are distinct, unique, isolated or 
unitary, in the sense of not having correlation with any others. 
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dependent upon the statistics of a population of persons. Those for 
systems (2) and (3) are correlation coefficients (loadings, or satura- 
tion coefficients). The original variates might be identical for all 
four systems of standardization, in which case the factor systems is- 
suing from each will not be independent, although each has its own 
point of interest for psychometry. But the variates can also be quite 
independent for systems (1) and (2). Systems (3) and (4), on the 
other hand are not independent of (1) and (2) respectively.* 

Systems (1) and (4) are what I called 7-technique elsewhere, 
since they deal with the correlations (7) between tests or separate 
traits etc., and analysis is by way of direct factor theorems. Systems 
(2) and (3) have been called Q-technique, their concern being with 
correlations (Q) between persons or whole aspects of persons, ana- 
lyzed by inverted factor theorems. 


An Example. 


The following example will illustrate the above systems, and sup- 
ply at the same time a standard of reference by which to judge past 
and future investigations involving factor analysis. 

Suppose that we make 100 measurements (a, to @,o) for each 
of a number of persons A to N. Let the measurements all relate to 
physical features of the body, height (a,), arm-length (a.), leg-length 
(a;), trunk diameter (a,), neck-circumference (a;), finger length 
(a,), and so forth. For the purposes of a controlled study, let the 
persons be of one age, and all comparable in point of nourishment 
and freedom from obesity etc. We can say a great deal about the fac- 
tor systems for such measurements, purely on a priori grounds.+ 

First let the physical measurements be standardized in rows of 
the matrix of data (system (1)). That is, the measurements for 
height (a,) are reduced to standard scores, for the population of N 
persons, and similarly for each row separately. Such rows can be cor- 
related, and it is obvious that the correlations must often be positive, 
and that a factor or factors will emerge for some of the variables, 
because the person with the greatest height will also tend to have the 
longest arm-length etc. The interpretation to be given to such fac- 
tors will be discussed later. 

But the crude measurements can also be standardized directly in 
columns (system (2)). All the 100 measurements are added, per per- 

*The matrices can be built up in other ways: thus individuals might be meas- 
ured for their Spearman, g, p, 0, w, f, c, ete. in standard terms, and then cor- 
related amongst themselves with these factor estimates as population. 


+It seems worth while as an example to pursue an investigation for such 
measurements. I hope to publish such data in due course. 











WM. STEPHENSON 199 


son, the mean and standard deviation for each person is calculated, 
and his crude scores reduced to standard ones. Such measurements 
can be correlated,* for persons as variables and the 100 physical at- 
tributes as population. The measurements are all considered entirely 
with respect to a person, without any quantitative reference to any 
other person. Different units of length can be employed for each per- 
son without making the slightest difference to the correlations. But 
it is obvious that all persons will correlate positively with each other 
(unless they are grossly deformed); indeed, the correlations will 
often be nearly perfect. It may be held at once that a general factor 
will be found for such data, whatever other orthogonal or other com- 
mon factors may emerge. Again, however, I shall discuss what mean- 
ing to attach to such factors later. 

If the measurements are first standardized as in system (1), and 
then correlated as in system (2), we have an example of system (3). 
All the attributes are now given equal weight, so to speak, a nose be- 
ing no longer than a leg in standard terms. So far as any general fac- 
tor of system (1) is concerned, this system (3) will show no counter- 
part to it: but any factors that emerge should not be independent of 
the subsidiary ones coming from system (1). System (3), however, 
can contribute information not already supplied by the previous sys- 
tems; and it may often be convenient to use because its measurements 
for a person are factor saturations and not regression estimates. 

An example of system (4) is supplied if the crude measurements 
are first standardized as for system (2), and then correlated as in sys- 
tem (1). If there is only one general factor (plus specific factors) in 
system (2), then no factors should emerge from system (4). Other- 
wise factors will usually appear, and will correspond in some way 
with any subsidiary common factors of system (2). Again, however, 
it is well to realize that it might sometimes be highly pertinent and 
fitting to measure attributes primarily relative to an individual as in 
system (2), and thereupon seek to factorize the attributes as vari- 
ables with respect to a population of persons. 

Thus four correlation matrices accrue in the above example. The 
first is for physical attributes as variables, for a population of per- 
sons; the second is for physical persons as variables,+ for a popula- 


*I purposely begin by calling attention to the standardization, which is, of 
course, automatically effected in the usual product-moment correlation coefficient 


zZuy 





— a, 0, 
+As nearly whole as the 100 measurements allow: a large population of such 
measurements would allow a still closer approximation to the whole physical 
person. 
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tion of attributes, measured relative to each person himself;* the. 
third is also for persons as variables, but the attributes have first been 
reduced so that all are statistically alike; the fourth is for physical 
attributes as variables, for a population of persons, but the attributes 
have all been gauged relative to each individual separately. It will 
not be denied that the matrices usually would contain significant cor- 
relations, and that factors would be discovered upon submitting these 
to factor analysis. It may also be granted that these factors, in the 
above example, with certain exceptions,+ would be reflections of each 
other ; that is, factors in system (1) would have a counterpart to some 
extent in each of the other systems (with the exceptions just men- | 
tioned). 

Nevertheless very different methodological approaches are in- 
volved. 


Two Methodological Approaches. 


System (2) in the above example begins from the standpoint of 
the separate physical person. It neglects each person’s absolute meas- 
urements, his absolute height etc. relative either to a population of 
persons or to an absolute physical standard (the metre or yard). 
Otherwise it can embrace in its population every physical measure- 
ment of a person, relative to himself. 

It begins from the standpoint of the whole body-proportions of 
the person, and it can embrace all possible relations of the physical 
person in this respect. In a sense it can engulf the “total situation”, 
for it takes all the physical measurements into account: but it always 
neglects the absolute features of all these physical measurements. 

I shall show in due course that system (2) can in the same way 
embrace the relational aspects of a person in cognitive and orectic 
spheres no less than in this one of physical measurements. If we ex- 
amine a person’s moods, for instance, every perceivable mood-condi- 
tion can enter into the population to be analyzed by way of system 
(2): but again we would in no way be concerned with the absolute 
quantities of any of the moods. 

System (4) follows logically from (2), beginning from the stand- 
point of the proportions of body-parts relative to each individual him- 
self. Each body-measurement is judged or gauged purely with respect 


*Different units of measurement could be used for each person, so long as 
the same unit is used for all the measurements of the one person. The units 
could be inches for one person, centimetres for another, feet for another etc. 

+Such as, that if only one factor covered the matrix for system (1), no 
broad factor could appear in system (3) in the above example. 
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to the individual of which it is part. In a sense the emphasis is still 
‘on the ‘total’ person, the “total situation”. 

System (1) begins with absolute measurements, in the sense of 
measurement in terms of a yard or metre. But it reduces all such 
measurements to standard terms, and the supposed absoluteness of 
the variates is merely a matter of relativity again — each person’s 
height in standard terms is a measure made in terms of and relative 
to the heights of a population of persons, and similarly for each sepa- 
rate attribute. 

It is true that system (1) could encompass the ‘total’ person in so 
far as all possible attributes (height etc.) are used as variables. But 
it completely neglects the relativity of such attributes, relative to each 
person, and it perpetrates a puzzling reduction of all the separate at- 
tributes to statistical terms of equal significance, irrespective of the 
relative significance of the attribute in the person. One cannot but 
have the suspicion that in the above example the system is dealing with 
severed heads, arms, noses, legs etc., piled into separate heaps, and 
that it will be impossible to find our way back to the whole person 
again. 

I have already mentioned that although system (1) appears to 
begin with absolute variates, it does so only in a sense ‘relative to a 
population of persons’. The system can certainly tell us if, and how, 
the various attributes vary proportionately in a population of persons. 
But it can tell us little or nothing about the whole body-build of any 
individual person. It supplies information of a general kind. There 
would seem to be only one set of conditions under which system (1) 
can give us information about the whole individual as such, namely, 
when a!] the variables are approximations to one and the same aspect 
or feature of individuals. This is the case, for instance, when mental 
tests are variables. The variates for each mental test can be legiti- 
mately reduced to standard terms; that the variables vary propor- 
tionately can be determined ; but the information so supplied is perti- 
nent to the whole individual, as Line* has shown. System (1) can 
supply undistorted information about mental test variates, because 

each represents, approximately, the same whole cognitive person in 
operation; each is an approximate measurement of process, measured 
in units of time. But unless some such common significance is at- 
tached to each variable, or may be so attached to each, system (1) 
cannot do other than distort the original facts, so far as the individual 


*Line, W. loc. cit. 
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is concerned,* as it undoubtedly does in the above physical example, 
and as it does, I shall later show, in work with temperament and 
character traits. 

It is interesting to draw a comparison between systems (1) and 
(3), and (2) and (4), in terms of gestalt and the so-called atomistic 
theories in psychology. Gestalt theory scarcely needs a definition if it 
is identified with that of the Wertheimer-Koffka-Kohler school of 
thought. In particular this school rebels against the viewpoint that a 
whole is a sum of parts; the whole, rather, has primacy over any 
parts.7 

Returning to our example of the physical attributes of persons, 
system (1) begins from the ‘atomistic’ standpoint. It begins with 
parts, and its factors also represent only parts of the individual. Thus, 
as is mentioned later, one factor may be specific to trunk measure- 
ments, and another to head measurements etc. Even the factors al- 
ready known in system (1), such as g, p, 0, w, etc., are also parts in 
the same sense. 

It has hitherto been a criticism of system (1) that, having taken 
the person to pieces and measured his g, p, 0, etc. separately, it could 
not proceed to put the person together again. The same criticism 
would apply, no less, to the physical variables or any factors arising 
from them. Now by applying system (3) to such part measures or fac- 
tors, i.e. using the part measures as variates for a person, and there- 
upon correlating persons, we can certainly put these measures together 
again, with respect to each person. But having done so, system (3) 
by no means builds up the whole person again. Its factors can only 
be distorted, unreal, or potential, with respect to any individual, be- 
cause it assumes the parts to be equally weighted for the individual 
at the outset — all the parts are given equal significance relative to 
the individual, when in fact they have not such equality. Beginning 
with parts in system (1), then, cannot lead to a whole person by way 
of system (3). 

System (2), on the other hand does not begin with such equally 
weighted parts — instead, each is given its proper significance rela- 
tive to the whole of which it is part, and which governs these parts. 
System (2) here seems to correspond with the gestalt standpoint in 
psychology ; system (1) and (3) correspond no less with the atomistic 
standpoint. System (2) encompasses the “total situation” ; system (1) 
does not really do so. System (1) begins by distorting the original 


*So far as the population of persons is concerned it perpetrates no such dis- 
tortion. 


+Peterman, B. The Gestalt Theory. (Kegan Paul, London, 1932). (I refer 
to the “principle of the primacy of the whole over its parts”.) 
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parts relative to the individual; system (2) effects no such distortion. 

Nevertheless, as I have indicated, system (1) can always be used 
to supply information about the way in which any variables, for part 
or ‘whole’ aspects of a person, depend upon one another, i.e. it will 
show whether the variables are related, in the sense of varying pro- 
portionately in a population of populations, and with respect to that 
population. In this way, however, the system supplies information of 
interest to general, and not to individual, psychology, except when 
‘whole’ aspects of persons are involved. As I have indicated, when 
the variables are for ‘whole’ aspects of persons, as is approximate- 
ly the case for cognitive tests as variables, then system (1) not 
only supplies information of value to general, but also of essential in- 
terest to individual psychology. If, with Line, we consider the meas- 
urements supplied by mental tests (cognitive abilities, or functions or 
processes in general) to be for the ‘whole’ person in action, so that 
they have gestalt implications at the outset, then system (1) can sup- 
ply information which in no way is a distortion of the original data— 
except in so far as any experimental abstraction from reality is ar- 
tificial. 

There can be no doubt that any data can be examined by either 
system (1) or (2). But there are circumstances under which it is 
most pertinent and proper to use either the one or the other, for other- 
wise the original data will be distorted. 


Further Observations. 


At risk of being tedious I add some further observations about 
the above example for physical attributes, and about the four stand- 
ardization systems in general. 

Any factor issuing from system (2) can have its nature com- 
pletely estimated.* A factor so described is an estimate of the most 
typical (hypothetical) person for the given relational system of at- 
tributes or measurements. The factor is represented by a population 
of measurements (ih standard terms) to which all persons of the fac- 
tor are approximating in some degree, and the factor saturation of a 
person is an index of how closely he approximates to this ‘typical’ per- 
son.+ The factor still refers to a ‘total’ person, except that it neglects 
the ‘absolute’ nature of the attribute, i.e. the relation of each to a 


*See Stephenson, W. A new application of correlation to averages. Brit. J. 
Educ. Psych. VI. 1936. p. 48. 

+I shall indicate in due course that factors in system (2) need not be ana- 
lyzed orthogonally, and that thereby the “ ‘total’ situation” is retained. 
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population of persons. This omission from the ‘whole’ person* is not 
so grievous a matter in system (2), as is the neglect of the relational 
features of the attributes in system (1). 

Factors in system (1) are an indication that the separate vari- 
ables entering into it vary proportionately in the population of per- 
sons. The person with the longest legs will tend also to have the long- 
est arms, the widest chest, and the biggest feet etc. The essential na- 
ture of a factor in the latter case is represented by a population of 
persons, each person in which has his own constant factor amount 
(g, in standard terms), which tends to be the same in all the variables 
entering into the factor.+ But this factor amount, which is estimated 
from a regression equation,{ cannot be said to give us any descrip- 
tion of the individual person. In the first place it is more than likely 
that any factor in system (1) will only be for some and not for all 
the physical measurements — one factor may be specifically connect- 
ed with the trunk measurements, and another with head measure- 
ments, etc. This indicates at once that the factors are not encompas- 
sing the ‘total’ physical person. Secondly, the factor estimate for a 
person in system (1) only has a meaning relative to all the other per- 
sons involved; the estimate is relative to the population of persons. 


System (3) concerns itself with a population of physical meas- 
urements, the same in name as for system (2). But the actual vari- 
ates are all in standard terms — a nose is on the average no longer 
than a leg in standard terms, and system (3) begins with such meas- 
urements. A factor will no doubt emerge. But its nature can scarcely 
be that of any ‘real’ physical person, because we have begun by miz- 
ing up the original measurements, so to speak, with those of other 
people. We could therefore scarcely hope to return from the general 
(represented by the factor in system (3)) to the particular (the in- 
dividual physical person). 

System (4), on the other hand, does not perpetrate such an ad- 
mixture in its original variates. Its factors again are an indication 
that the physical features entering into a factor vary proportionately, 
just as for system (1). 

System (1), of course, is that used exclusively in the past in fac- 
tor analysis. It supplies the basis for scientific work on empirically 


*The word ‘whole’ must be taken here to mean relational together with ‘ab- 
solute’ aspects of the physical measurements. 

+This is merely a literal description of what a theorem such as the Spear- 
man two-factor (a common g and specific s factors) means. 

tIn the Spearman two-factor theorem the amount is: 


=ry:m + 6745 V1— 1, 
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observed or cbjectively determined individual differences.* Although 
it begins with the premisses of individual differences, it only contem- 
plates measuring any individual for those differences which enter in- 
to a factor. When a factor appears, such as Spearman g, there is some 
guarantee that all persons are alike in respect of certain individual 
differences. In this respect the system is primarily of interest to Gen- 
eral rather than to Individual psychology. 

System (2) does not begin from the standpoint of individual dif- 
ferences of any kind, but with empirical discoveries of a qualitative 
kind. Individual differences are in no way essential to it. But it gives 
rise to a kind of individual differences which has never before been 
attended to by psychometrists, namely, individual differences in type.} 
So far as system (2) is concerned, however, all persons might be of 
a type i.e. approximating to the same “typical” person, and they could 
also be identical in degree of factor saturation for the type. It just 
happens that in fact they are not. That different types emerge, and 
different degrees of each, is a matter for objective determination. 
Preliminary results show, as we might expect, that for many popula- 
tions of traits or other data, all persons are not of a type.ft The dis- 
trivutions of degrees of type for a population of persons is also a mat- 
ter for determination from the observed facts: for eidetic type stud- 
ies, for instance, the distribution of factor saturations may be marked- 
ly skewed. Only the facts can tell. 


Conclusion. 


I cannot flatter myself that I have made everything in the above 
pages unequivocable, or even understandable. The main contribu- 
tions, however, are clear enough. 

1. The matrix of persons (row) and attributes (column) used 
by factorists can be employed in more ways than the one used almost 
exclusively in past factor analysis and psychometry generally. Four 
ways have been described and illustrated. 

2. Two of these four (1, 3) have atomistic foundations unless 
they deal with process measurements. The other two (2, 4) have re- 
lativistic and gestalt implications. 

3. The criterion for use of system (1) would seem to be that 
the variables are equally significant for the individual; when they are 

*The words refer here, and always in the sequel, to differences with respect 
to a population of persons. 


{The word ‘type’ is here and throughout this paper used in a special sense, 
so far as I know never before used. It is the subject of a paper to follow this one. 


tSee examples in my various introductory papers. 
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not, then system (2) is the appropriate system to begin one’s investi- 
gation with, from the standpoint of the individual. 

4. There can be no doubt, I think, that system (2) deals most 
faithfully with the original data in the case of the physical measure- 
ments, from the standpoint of the individual. 

5. System (4) supplies a means for investigating whether and 
how separate* attributes vary proportionately. It would seem to be 
the gestalt counterpart of system (1), but system (1) is more ob- 
viously applicable to studies of this general kind, i.e. the varying pro- 
portionalities of attributes for a population of persons. 

I would conclude by reiterating that there is no question in the 
above of suggesting that systems (2) and (4) should replace systems 
(1) and (3) in psychometry. It is obvious that all four can be used, 
each in its appropriate place. But system (2) can deal with ‘total’ 
persons, whereas system (1) only does so for process measurement. 


* * * * * * * * * 


Note added July 10th, 1936. 


In the above paper I have been concerned with certain particular and gen- 
eral applications of the four factor systems. Only a brief definition of each sys- 
tem was given, a full account being reserved for the main work upen which I 
have been engaged for some time, which is to be published shortly. The above 
account seemed to be sufficient for the present purpose, which was to draw atten- 
tion to other possibilities in factor analysis, and to other applications, than those 
widely known to most workers. But the brevity of the definitions, and certain 
special features of the exampie by which I have illustrated them, may lead to 
misunderstandings which the fuller treatment will obviate. Meanwhile, the fol- 
lowing further notes may forestall misinterpretations, at least. 

The important systems are (1) and (2), the other two being of minor imme- 
diate interest. I have clearly stated that systems (1) and (2) can be quite inde- 
pendent, but it will be better to state explicitly that I regard the two as, by very 
definition, statistically independent of one another in general. It will be helpful, 
in this connection, to distinguish between two different forms that system (2), 
or (1), can take. 

System (2) in general is not to be regarded as the direct obverse or mere 
transpose of data already analyzable by way of system (1); nor is it the case 
that these two systems are merely two complementary ways of analyzing one 
and the same original matrix of data, the results being no less complementary or 
deducable one from the other. Such complementary conditions, and the possibility 
of direct analysis either in rows or in columns, is only possible under the special 
ease of universality of unit. One must first define what units are involved in 
either the rows or the columns. It is not sufficient to put scores into a matrix and 
then say without more ado that they can be analyzed either by rows or by col- 
umns. Of course they could be; but unless something is told us about the units 
involved, such two-way analysis will usually be a futile and useless proceeding. 
It is precisely the purpose of systems (3) and (4) to ensure, under the conditions 
of non-universality of unit in either rows or columns respectively, a general re- 
duction to comparable units, so that analysis would then be sensible and possibly 
useful in general. In the very special case, however, when one and the same unit 


*The word again refers only to the mode of thinking about the attributes. 
Systems (1) and (4) proceed to show whether the attributes vary proportionate- 
ly, and when any two so vary neither can be regarded as isolated or distinct. 
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of measurement is involved in both columns and rows by definition (as could be 
the case in the example of the physical attributes described above, if an inch is 
used throughout as the unit of measurement, for all persons and all attributes), 
direct analysis by either rows or columns is sensible and permissible. In this case 
the same original data, without any reduction of any kind, can be analyzed either 
by system (1) or by system (2). 

In the general case, however, the data is by very definition distinct for analy- 
sis by way of system (1) or (2) respectively. In system (1) it is not in the least 
essential to have one and the same unit for all attributes or tests; it is merely 
essential that the unit for any one attribute should be one and the same for the 
whole population of persons. In system (2), likewise, it is not essential to have 
one and the same unit for all persons; it is merely essential that the unit for any 
one person should be the same for the whole population of attributes involved for 
that person. The units for system (1) moreover, are generally by definition or 
fact different from any of those involved in system (2), so far as we know, or 
need know. 

It so happens that I consider this non-universality of unit, in either rows or 
columns respectively, to be a matter of the very first importance. In the study of 
personality, for instance, matters of the greatest possible interest depend upon 
this fact, that analysis can proceed without a universal unit of measurement. 

The burden of the above paper, however, amongst others, was that of show- 
ing that even when we begin with special data of the kind used in the example, 
for the 100 physical attributes, and analyze it by both system (1) and system 
(2), the two different standardizations effected (when one correlates the rows, or 
the columns), effect a radical alteration in the original data, and therefore make 
the two subsequent factor analyses really incomparable. The two are only depen- 
dent in so far as both use the same crude data; but the different standardizations 
effect profound changes in this crude data. As I have already said, analysis by 
way of system (2) could proceed even if different units are used for each person 
respectively; but in that case direct obverse analysis of that same data by way of 
system (1) would be a futile and meaningless proceeding. The same is true, of 
course, if the original data is for unique or different units per attribute (but the 
same for all persons, per attribute) ; in such a case subsequent analysis of the 
same data as it stands, by way of system (2), would be futile and meaningless. 
But even if one and the same unit is used throughout, for all persons and all 
attributes, the respective standardizations to which the data is submitted (in ef- 
fect) when either rows or columns are correlated, radically change the original 
data. In effect, therefore, one is not dealing with the same data in the two sys- 
tems, for factor analysis only concerns standardized material, and not crude data. 
It would have been better to have stated this explicitly or more emphatically. 

In the above paper I wrote (largely so as to allow me to press on with the 
essential argument) that the various factors arising in the different systems 
might, to some extent, be “reflections” of one another. Had space permitted, or 
intention inclined me to it, I should have elaborated this considerably. Ac- 
tually I go on to show that the factors will in fact be very different in the various 
systems, both as to essential nature and as to interpretation, and this is gen- 
erally the case. They are really only superficial “reflections”, as matters of in- 
terpretation. 

I had intended expanding on matters of the above kind in my fuller study 
of system (2), for there are still other interesting considerations involved. In 
general however, in system (2) I am concerned to adumbrate a system which 
will be the very poles apart from system (1). System (1) involves a population 
of persons, but there need only be one attribute for investigation; system (2) 
involves a population of attributes, but there might only be a single person in- 
volved. (I shall supply examples in due course of factor analysis, performed 
upon myself alone as variable.) I therefore demanded quantification which could 
be confined to a single person, uniquely if need be, and system (2) supplies a 
means for dealing with such quantifications. Similarly, in system (1) there is 
an unlimited population of persons from which to draw one’s samples, and sam- 
pling error theory is pertinent. But in system (2) there may be a restricted 
population of attributes, so few that one’s ‘sample’ constitutes the whole universe 
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= these attributes; I have therefore to contemplate other than sampling error 
theory. 

The statistical differences between systems (1) and (2), then, are radical 
and fundamental. Usually, by very definition we begin with data the units for 
which are expressly defined with respect to system (1), or with respect to system 
(2), and there is no necessary connection at all between them. The psychological 
applications of these systems are no less profoundly different. 

In the above paper I have tried to indicate very briefly some of these psy- 
chological applications. System (1) has its counterpart in the ‘atomistic’ stand- 
point in psychology, whereas system (2) encompasses a gestalt standpoint. I 
shall, of course, have much more to say about this in future work, but sufficient 
has been said, I think, to point towards this important conclusion. I may be par- 
doned, however, if I elaborate a little on two matters touched upon very briefly in 
the above paper. 

Factor analysis by way of system (1), as I shall show in detail elsewhere, 
is concerned primarily with tendencies, i.e. with the way in which attributes vary 
one with another in a population of persons. It begins from the standpoint of 
observed individual differences (relative to a population of persons), and arrives 
at information about abilities, processes, unitary traits, primary abilities, or in 
general, fundamental tendencies. Each such tendency or process is an explanation 
for the observed facts, namely, that such and such variables vary proportionately 
and uniquely. The factors are by definition universal. And it seems to be a fact 
that there is only a limited number of such fundamental tendencies in the hu- 
man being: Spearman found only five or so; Thurstone specifies seven; the Thorn- 
dike Unitary Traits Committee hoped to find from 1 to 20. The implication is 
that these few fundamental tendencies account for, or explain, or are the cause 
of, all human conduct. Right through history there has been a search for such 
tendencies, or else a talk about them, from the classical humours of Hippocrates 
or Galen, to the classical Introversion-Extraversion of Jung, and now the same 
kind of thought is involved in the search for unitary traits. System (1) is the 
method for investigating such fundamental tendencies. I do not as yet deny its 
efficacy in this respect, but I draw attention to the fact that the system is lim- 
ited to such investigations. 

System (2) accomplishes something very different indeed. It can supply no 
direct information about fundamental tendencies of the kind just referred to. 
Instead it deals with whole personalities, so-called ‘irrelevancies’, in general with 
‘total situations’ in what I take to be the gestalt meaning of these words. It 
deals with attributes relative to each individual, and to himself alone, and each 
attribute has its full and proper context (in so far as this is reasonably possible 
in any analysis) in the whole of which it is part. This is excellently illustrated, 
I think, by application of system (2) to the 100 physical measurements described 
above. Like all analysis, system (2) has to parcel out its applications. It can 
only deal with whole aspects of persons, with the physical whole, the mood-con- 
dition whole, the cognitive whole, and so forth. All measurement is relative to 
the individual, and to himself alone. It has no necessary concern at all with in- 
dividual differences of the system (1) a priori or a posteriori kind — instead 
it gives rise to its own kind of individual differences, those of type and type sat- 
uration or loading. Its factors are not universal necessarily; they may be, but 
only as a matter of a posteriori objective determinations. 

In the physical example I postulated 100 attributes. To cover the physical 
whole more completely, or more determinately, this population might be increased 
to 1900, or to 10,000 if need be. But even 20 of these attributes might define the 
physical whole of all persons sufficiently approximately. It would have to be 
shown that whatever population is selected from amongst all the possible physical 
attributes of a person, the factor systems only altered in determinacy, and not 
in essential nature. In such a case we could legitimately say that we are dealing 
with a real physical whole, a ‘total situation’. It seems too obvious to mention 
that a whole situation can never be encompassed if by this we mean, as Thur- 
stone apparently means, the infinite universe of all possible attributes and all 
possible persons. In system (2), as in system (1), we can only deal with a finite 
population of either persons or attributes. Even so, system (1) is fundamentally 
‘atomistic’ (fortunately, for it is its special merit that it allows an approach to 
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the study of fundamental tendencies), whereas system (2) deals with ‘total situ- 
ations’ in the above gestalt sense. In system (1) each attribute is stripped from 
its total context, each is standardized in terms of a population of persons (which 
destroys all possibility of returning to the individual wholes of the person). The 
‘total situation’ can never be realized in system (1), even if the whole universe 
of attributes are used as variables. It may be retained intact in system (2), with 
more or less determinacy, for as few as ten, or even less, attributes. 

I have said that there are possibly only a few fundamental tendencies in the 
human being, and therefore only a few unitary traits are born out of system 
(1) analysis. But there are possibly millions of types, each a common factor, 
(common that is to several or many persons, but not necessarily to all), in system 
(2) analysis. Type psychology comes into its own by way of system (2). Almost 
everything that we factorists have denied about type psychology has been beside 
the point: types exist in great number, and one person can be of one type and 
not at all of another; types are not extremes of otherwise normal distributions, 
if bd type we mean the interpretation given to a common factor in system (2) 
analysis. 
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THE APPROXIMATION OF ONE MATRIX BY 
ANOTHER OF LOWER RANK 
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The mathematical problem of approximating one matrix by an- 
other of lower rank is closely related to the fundamental postulate 
of factor-theory. When formulated as a least-squares problem, the 
normal equations cannot be immediately written down, since the ele- 
ments of the approximate matrix are not independent of one another. 
The solution of the problem is simplified by first expressing the mat- 
rices in a canonic form. It is found that the problem always has a 
solution which is usually unique. Several conclusions can be drawn 


from the form of this solution. 
A hypothetical interpretation of the canonic components of a 


score matrix is discussed. 


Introduction 


If N individuals are each subjected to v tests, it is a fundamental 
postulate of factor theory that the resulting » * N score matrix a 
can be adequately approximated by another matrix # whose rank r is 
less than the smaller of n or N. Closely associated to this postulate 
is the purely mathematical problem of finding that matrix 6 of rank 
r which most closely approximates a given matrix a of higher rank R. 
It will be shown that if the least-squares criterion of approximation 
be adopted, this problem has a general solution which is relatively 
simple in a theoretical sense, though the amount of numerical work 
involved in applications may be prohibitive. Certain conclusions can 
be drawn from the theoretical solution which may be of importance 
in practical work. 

To formulate the problem precisely, it is convenient to define the 
“scalar product” of two x < N matrices as the following numerical 


function of their elements: 
n N 
(a, 8) = = ais Bij (1) 


where aj; is the ij-element of the matrix a. This function has all the 
properties of the scalar product of vectors: 
(a, 6) = (f,4) ; (2) 
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(a =: f, y) = (a, v) sm (8, y) ; (3) 
(va, f) =-2(a,f) if x isa number; (4) 
(a,a)>0 ifas0. (5) 


The positive number /, defined by /? = (a, a) may be called the length 
of the matrix a; the number /? was called the span (Spannung) of a 
by Frobenius.* The length of the matrix a — f may be called the dis- 
tance from a to #. 

The problem can now be formulated in a definite manner: to find 
that matrix § of rank 7, such that there is no other matrix of rank r 
whose distance from a is less than the distance from f to a. This 
amounts to requiring a least-squares solution of the approximation 
problem, every element of the given matrix being given equal weight. 


Some preliminary theorems and remarks. 


To simplify the following discussion, let a, b, u, --- denote n « n 
matrices, A, B, U, --- denote N * N matrices, and a, §, --- denote 
n X N matrices; the special case n = N is not excluded. The N & n 
matrix which is obtained by writing the columns of a as rows is the 
transpose of a and will be denoted by a’. The products aa, aA, aB’, 
a’a, --- are defined in the usual way. It is then seen that the following 
equations are correct: 


(a, 8) = (a’, 6’) ; (6) 
(aa, B) = (a, a’p) = (a, fa’) ; (7) 
(aA, 8) = (a, BA’) = (A, a’B) . (8) 


From Eq. (7) and (8) it follows that if « and U are orthogonal mat- 
trices (u wu’ = wv’ u = 1,, U U’ = U’ U = 1y), then 


(wa U’,u BU’) = (a, 8B). (9) 
Another useful proposition is the following: if (a,b) =— 0 for all 
symmetric (skew-symmetric) matrices, b, then a is skew-symmetric 


(symmetric). 

The solution of the problem is much simplified by an appeal to 
two theorems which are generalizations of well-known theorems on 
square matrices. They will not be proven here. 


*Quoted from MacDuffee, “Theory of Matrices”, Ergebn. d. Mathem., v. 
2, No. 5 p. 80 (1933). 

+Courant and Hilbert, “Methoden der mathematischen Physik’ Berlin, 1924; 
pp. 9 et seq., p. 25. MacDuffee, p. 78. 
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Theorem I. For any real matriz a, two orthogonal matrices u and 
U can be found so that 2 = uaU’ is a real diagonal matrix with no 
neyative elements. 

A diagonal matrix 4 (square or rectangular) is one for which 
Ai; = 0 unless i = 7. If a diagonal matrix is rectangular, then there 
will be some rows or columns which consist entirely of zeros. For the 
following, this remark is of some importance, as will be seen. The 
equation of the theorem may also be written 


a=wiu (10) 


whose right side may be called the canonic resolution of a. If n < N, 
4 will have N—n columns of zeros and a is seen to depend only on the 
first n rows of U. If u, 4 and the first rows of U are given, a is 
determined. 

Let v be the diagonal n  n matrix which consists of the first n 
columns of 4, and w the n & N matrix composed of the first n rows 
of U; then these remarks can be summarized by the equation 


a=wve (10.1) 


where w w’ = 1,, but w’w ~ 1y. For numerical work, Eq. (10.1) is 
preferable to Eq. (10); for formal manipulation, Eq. (10) is more 
convenient. : 

The numerical evaluation of u and v (or 4) can be accomplished 
from the consideration of the matrix a = aa’ alone. This matrix is 
closely related to the matrix of correlation coefficients of the tests. It 
is seen that 


exmad=widsvuxwvu, 


and since 4 7’ = v? is a diagonal matrix, it follows that u is one of the 
orthogonal matrices which transform the correlational matrix to di- 
agonal form. The rows of uw are unit vectors along the principal axes 
of a and the squares of the diagonal elements of v (or 4) are the 
characteristic values of a; this shows that the latter can never be 
negative numbers, a result which can also be obtained more directly.* 
The methods for determining the principal axes and characteristic 
values of a symmetric matrix are also known,7 so that these remarks 
may be considered as indicating the method for calculating wu and 2. 
If none of the characteristic values of a is zero (this will presumably 
be the case in the overwhelming proportion of actual calculations) 


*Courant-Hilbert, p. 20. 
+Courant-Hilbert, pp. 13, 16. 
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the matrix v will have a reciprocal, and w can be obtained by solving 
Eq. (10.1): 


Ov" wWa. 


The numerical values of the elements in the remaining rows of the 
matrix U will not be needed, but could be found if necessary. For 
simplicity of manipulation, it is convenient to proceed as though this 
has been done. 

The diagonal elements of 2 were called the “canonical multipliers” 
of a by Sylvester.* The multipliers and characteristic values of a cor- 
relational matrix are identical; in the case of a symmetric matrix, 
there may be a difference in sign; for a general square matrix, there 
is no simple relation between the two; for a rectangular matrix, the 
characteristic values are not defined. 

The correlationai matrices, Sylvester called the “false squares” 
of a. In the foregoing (and in the usual treatment of factor theory) 
only the matrix a = aa’ has been considered. However, the matrix 
A = a’a is related to the correlation coefficients of the individuals in 
the same manner as a is related to the correlation coefficients of the 
tests. There is complete mathematical symmetry between the two 
correlation matrices. 

To every multiplier, there is associated a row of uw and a row of 
U; this complex of n + N + 1 numbers may be called a canonic com- 
ponent of a. 


Theorem II. If af’ and f’ a are both symmetric matrices, then 
and only then can two orthogonal matrices u and U be found such 
that 4= ua U’ and u=ufpU’ are both real diagonal matrices. 


Either one (but in general, not both) of the diagonal matrices 
may be further restricted to have no negative elements. This theorem 
is a generalization of the theorem that the principal axes of two sym- 
metric matrices coincide if and only if ab = ba. 


Solution of the problem 
The distance of 6 from a is given by x, where 
x? = (a,a) —2(a, 8) + (8,8) ; (11) 


x is a function of all the elements of f, and these are to be determined 
so that its value is a minimum. The elements of f are not all indepen- 
dent, however, because of the requirement that its rank be less 
than the number of its rows or columns. Theorem I makes it possible 


*Messeng. Math., 19 p. 45 (1889). 
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to eliminate some of the interdependence: suppose { to have been re- 
solved into canonic form: 


pow uU (12) 


with uw diagonal, and u and U orthogonal matrices. Then the rank of 
6 will be r if and only if uw has this rank i.e., if just 7 of the diagonal 
elements of uw are different from zero; the non-vanishing elements of 
u will be independent. However, the elements of u or U will not be in- 
dependent, since these matrices must be orthogonal. It is not neces- 
sary to express these matrices in terms of independent parameters 
because of the following proposition:* if uw is any orthogonal matrix 
and the independent variables that determine it are given any infini- 
tesimal increments, the resulting increment of ~ is 


bu=Us, (13) 


where s is a skew-symmetric matrix whose elements are infinitesimal, 
but otherwise arbitrary. 
The Eq. (11) becomes, because of Eq. (12) and (9), 


x? = (a,a) —2(a,wuwU)+ (u,u) . (14) 


Since x is to be a minimum, it follows that 6 x? = 0 when wu is given 
the increment 6 u (Eq. (18)). 
Hence 


0= (a,—sw wU) =— (a,8 8) =— (af’,8) ° (15) 


Since s is an arbitrary skew-symmetric matrix, it follows that a ~’ 
must be symmetric. Discussing the increment of U in the same man- 
ner, it will be found that /’ a must also be symmetric, and hence, by 
Theorem II, the orthogonal matrices can be found so that Eq. (12) 
and 


a=wiU (12.1) 


(with 4 the diagonal matrix of the multipliers of a) are both valid. 
Then Eq. (11) becomes 


2? = (A—u, A—n) 
= Di (Ai — wi)? (11.1) 


4; and u; being the diagonal elements of the corresponding matrices. 
It remains to determine the matrix u so that this expression has 
its minimum value, subject to the condition that just 7 of the y; shall 


*Courant-Hilbert, p. 27. 
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be different from zero. It may be supposed that 4, = 4.24; 2-::; 
then an obvious solution of the problem is 


mend, , tr 3 
=—-0, i>r. (17) 


This is also the only solution unless 4, = Apis. 

The procedure of finding 6 may be summarized: (1.) express a 
in the canonic form of Eq. (12.1); (2.) replace all but r of the di- 
agonal elements of 4 by zeros, beginning with the smallest and con- 
tinuing in order of increasing magnitude. (3.) The resulting matrix 
is uw, and # is given by Eq. (12). The solution is unique unless 
ri ss ey . 

The minimum value of x? is 


R 
ze? = > Ai? 
i=r+1 


(R being the rank of a and the multipliers being numbered as before). 
This leads readily to the following conclusion: if 1 is the length of a, 
the smallest upper bound for x is 1(1—r/R)*”. This can be utilized 
in estimating the significance of an approximation to a obtained by 
some other method than the present one. 

Another important conclusion is that if 6 is the best approxima- 
tion (of rank 7) to a, then b= § (’ (B = f’ 8) will also be the best 
approximation (of rank 7) toa—=aad’ (A =a’‘a). Thus, if the best 
approximation to the score matrix is found, then the correlational 
matrices calculated from it will automatically be the best approxima- 
tions to the correlational matrices calculated from the original score 
matrix. This is not surprising, but requires proof; the proof is readi- 
ly supplied from the foregoing results. 


Concerning a hypothetical interpretation 
of the canonic components of B 


It is reasonable to inquire if, when a is a score matrix, those of 
its canonic components that enter into 6 may be interpreted as the 
independent factors that determine the differences in the performance 
of various individuals. In order to discuss this question let p = 1, 
--- m number the tests, 7 = 1, --- N number the individuals, and o or 
o = 1,--- y number the components of f that have non-vanishing mul- 
tipliers. Then the component o consists of the n +- N + 1 numbers 


Upp » A>, Upi 
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which satisfy the equations 
Dp Upp top= Spe 3 Ds Ups Vor = Spe | (18) 
Bri = Dp Upp Ap Upi « (19) 


These equations suggest that if these numbers have any empiri- 
cal significance, then N’/? Up; will be the standard score of individual 
i for the ability to exercise the factor 0, and n”? up, the standard score 
of the test p for its demand for the exercise of the factor. The ap- 
pearance of the multipliers in Eq. (19) does not correspond to the 
accepted postulates of factor theory, and their interpretation is thus 
not immediate. 

To clarify this point, one may inquire after those empirical cir- 
cumstances, which if they are actually realized, will lead one to ac- 
cept the foregoing interpretation. One such set of circumstances 
would be the following: suppose that two sets of individuals, N; and 
Ny in number, have been tested by the same battery of n tests. The 
scores of the N,; individuals can then be arranged in an n  N; ma- 
trix a;, those of the Ny individuals into an n * Ny matrixar; or, the 
scores of all N; + Ny individuals can be arranged into an n < 
(N; + Nu) matrix a obtained by adjoining a; and ay. First consider 
a and ay; if these can be approximated by matrices f; and fn having 
the same u-matrices* (within reasonable limits of error), then the 
foregoing interpretation would be appropriate. In this case the 
u-matrix obtained by approximating a would also be the same as the 
others. The multipliers Jp1, 4pm and jp, and of course also the 
Upiz, Upisn and Up; obtained from the three score-matrices would not 
be identical. In order to discuss their interrelations, let the index i 
now run from 1 to N; + Nn, the first N; values designating individ- 
uals of the first set, etc. Then it can be shown that 


Ap? = dp? + dpm (20) 
and 
dp Upi = Ap,x Upiz, ifi SN, ; 


=Apu Upin, if > Nr. (21) 


These are precisely the relations that would be expected accord- 
ing to the suggested interpretation if A4p1;°/N: and Jpn? /Nu were the 
variances of the two sets of individuals. This is not a tenable inter- 
pretation, however, because of the symmetric manner in which the 


*The sufficient condition for this equality is that a, a,,; be a symmetric matrix, 
where a, = 4,a,’, etc. This criterion can be applied even when w, and u,, are not 
known. 
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individuals and the tests enter into the calculations. It may, how- 
ever, be supposed that j,:°/nN; is the product of the variance of the 
set of individuals with the variance of the set of tests. It should be 
noted that it has been implicitly assumed that the scores are not meas- 
ured from the averages of the different sets of individuals, but from 
a fiducial zero which is the same for all sets. 
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A FURTHER SIMPLIFICATION OF THE 
MULTIPLE AND PARTIAL CORRELATION PROCESS 


HAROLD D. GRIFFIN 


Nebraska State Teachers College 
Wayne, Nebraska 


Formulas are derived for simplified computation of partial and 
multiple correlation coefficients, and generalized to n variables. Time 
required for computation is compared with other methods. 


1. Introduction 


Psychometricians and others who work with the multiple and 
partial correlation techniques and with regression equations will find 
that the methods presented in this article will save an appreciable 
amount of time and labor. The greater the number of variables, the 
more apparent will be the saving effected by these new formulas. 

Several years ago the writer developed and published fundamen- 
tal formulas for the Doolittle method in terms of zero-order correla- 
tion,* but his failure to discern relationships within the formulas to 
known statistical concepts other than 7’s and f’s prevented him from 
expanding the formulas beyond four variables at that time. 


The formulas were: 


Three Variables 


Bor.1== (102 — 101 M12) / (1 — P22?) eke ye 2 ee Te Se 
Bor2= To — Boe. cy a a a ee ee ee « « JA 


ee a a a a se ] 
Four Variables | 


To2 — To Niz 
$$ (123 — 12 N13) 
1 —— mee 


Bos.r2 — . % : ¥ [3a] 


Y23 — Tie Ti3 
1 — 7,3 — ——— = (823 — Pz 113) 


To3 — To1 N13 — 





To2 — To1 Nr2 23 — Tre Y13 


A, a.) pd Ad 3b 
i—_r, Bos.12 1—r. [ J 


Boz.13= 


*Griffin, Harold D. “Fundamental Formulas for the Doolittle Method Using 
Zero-Order Correlation Coefficients.” Annals of Mathematical Statistics I 
(May, 1981), pp. 150-153. 
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Bor.2s = 11 — (Boz.13 Tio + Bos.12 13) . ° ; ‘ . . . [3c] 
Ross = (Bor.23 To1 + Boo.13 Toe + Bos.r2 To3)* . ‘ . ; ; > [ 4] 


The writer was encouraged to seek concepts to express various 
recurring parts of these formulas by Professor Walter J. Saupe of 
the University of Missouri and Professor George W. Snedecor of 
Iowa State College. As a result, formulas for problems in three to 
eight variables were published in 1933* in which two new concepts 
were introduced—a (Greek, alpha) for the numerator and w (Greek, 
omega) for the denominator of the partial regression coefficient, f 
(Greek, beta). 

The present article contains a still further simplification of the 
formulas, with elimination of the concept w by substitution of the 
more familiar concept k?, the coefficient of non-determination.+ A 
further extension enables the new formulas to be used not only for 
rapid and precise calculation of the multiple correlation constants but 
for partial correlation problems as well: 

The concepts and their symbols which we shall employ in these 
new formulas are as follows: 


N = the number of cases or observations 

nm = the number of variables 

y= (Greek, nu) the number of independent variables = n — 1 
M = the mean, arithmetic mean 

o = standard deviation 

r = correlation coefficient 

a = incomplete regression coefficient, alpha coefficient 
k? — non-determination coefficient 

k = alienation coefficient 

6 = partial regression coefficient, beta coefficient. 

R?2 = multiple determination coefficient 

R = multiple correlation coefficient 

b = regression coefficient 


*Griffin, Harold D. “Simplified Schemas for Multiple Linear Correlation.” 
Journal of Experimental Education I (March, 1933), pp. 239-254. 

+The following changes will adjust the schemas in the article mentioned in 
the preceding footnote to the new notation: 


Wy, = hp, O94 5 = hg 05 95.34 = 5,034 26.245 — *%o.2865 

Woy = Ky, Wy5 3 = Ks 93 M06.94 = h%6.034 27.345 — ¥?7.0545 

a5 = hy, Wo6.g = 6.03 27.94 = 7.034 28.345 — **s.0345 

Wag = yg &g7.3 = K*, 93 Wag. 94 = Hs. 034 

Wy, = 1 ,, Wog. =k, 95 57.3456 —= "7.03456 


a,, =F, 08.3456 — ‘8.23456 
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C = constant in the regression equation calculated from the 


means 
X = araw value or score 


X = predicted raw value or score 
S = standard error of estimate 
og = Standard deviation of a beta coefficient 
To, = zero-order correlation coefficient 
11.2 = partial correlation coefficient 
2. Three-Variable Problems 


We present below the set-up for a three-variable problem in 
which the object is to obtain the minimum essentials for prediction— 
the beta coefficients, the regression equation, the multiple correlation 
coefficient, and the standard error of estimate. 

Required Zero-Order Data 
Mo, Mi, Me. a0, 01, 02 © Tory To2s Tr2 « 


First-Order Calculations 


Qo2.1 = Noe — 101 112 *g = 1 — *®20 
ke. = 1 — T* . 
Final Beta Coefficients 


Bo2.1 = Qo2.1/k* 12 

Bor.2 = 101 — Boor Tie 

Check: 1o2 = Bo1.2 M12 + Boos - 
Multiple Correlation Coefficient 

Ko 12 = Keo, — Boo.s Goan 

Ross = (1— k*o.12)* 

Alternative: Ro12= (Bo1.2 Tor + Boor To2)* . 
Regression Equation 

Bor.2 = Bor.2 0/1 

Bo2.1 = Bo2.1 0/82 

Co.12 = My — (My Dor. + Me B02.1) 

Xo = bore X1 + Voor X2 + Core « 
Standard Error of Estimate 


So.12 == do Kose ° 
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In a three-variable problem each of the final béta coefficients, 
Boi.2 and foo, will have the same standard deviation. The three-vari- 
able formula for the standard deviation of a beta coefficient is 


801.2 and Bo02.1 = (Ko12/Kk*12N)? . 


When the partial correlation coefficients are desired in a three- 
variable problem, the following additional steps are necessary: 
Additional First-Order Calculation 


koe = 1— Po . 
Partial Correlation Coefficients 
ore = (1 — K*o.12/k 702)? 
Too. = (1 —Ko.12/k%01)? . 


The partial correlation coefficient always takes the sign of its cor- 
responding beta coefficient. Therefore, if 8:2. is positive, 71.2 will 
also be positive. 


8. Four-Variable Problems 


The four-variable set-up for prediction is as follows: 
Required Zero-Order Data 


M,, M3, M., M; . Oo» Tips T2303 « 
Tors To2» To3 Ti29 Nis) Yes « 


First Order Calculations 


Ao2.1 = No2 — Yor Viz k?o. =1— 2, 

Q03.1 = To3 — Tor N13 k?2 = 1— 712 

023.1 = 123 — N12 N13 k?,,; = 1—7?",; 
Boos = Qo2.1/k* 12 


Bso.1 = Gto3.1/K* x2 . 
Second-Order Calculations 
03.12 —= Ao3.1 —— Boo. 23.1 
ko.12 == ko, — Boo.a Qo2.1 


ks 12 = k*13 — Bso.1 23.1 
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Final Beta Coefficients 
Bos.r2 = Gos.12/K*s,1 
Bo2.13 = Bor.1 —Bos.12 Bs2.1 
Bor.23 = V1 — (Bor.13 T12 + Bos.12 T13) 
Check: 1o2 = Bor.23 712 + Boars + Bos.r2 123 - 
Multiple Correlation Coefficient 
k?o.123 = Ke? 0.12 — Bos.12 Gos.12 
Ro.a23 = (1 — ko.123)? « 
Regression Equation 
Bo1.23 = Bor.23 60/01 
Bov.13 = Bor.13 50/02 
Bos.12 = Bos.12 Fo/o3 
Co.123 = My — (My Bor.23 + Me Doors + Ms Bo3.12) 
Xo Bores Xa + Bears Xo + Boars Xs + Co.rss - 
Standard Error of Estimate 


So.123 — Oo Keo.r28 . 


When the standard deviations of the beta coefficients are wanted 
in a four-variable problem, it will be necessary to add the following 
steps to the foregoing procedure: 

Additional First-Order Calculations 


Q13.2 = 113 — Nie 123 k?,,== 1—1?*,; 
B13.2 = O13.2/k* 25 
Bos.1 = Qe3.1/k715 . 
Additional Second-Order Calculations 
Ke, 93 = ke? 12 — Bis.2 Qis.2 
Ke?o.13 = Ke*12 — Bos.1 Gos1 « 
Standard Deviations of the Beta Coefficients 
Op, 5, = (Hosas/'x.a0N)} 
© eng = (Ke 0.123/K2.13N )} 
Op... = (Ke103/Ksa2N)? « 
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When the partial correlation coefficients are required in a four- 
variable problem, it is necessary to add the following formulas to the 
basic set-up: 


Additional First-Order Calculations 
Gos.2 = Tos — Toz T23 Koo = 1 — 7° 2 
k?.3 = 1 — 123 
Bos. = Oos.1/k*1s 
Bos.2 = Gos.2/k23 . 
Additional Second-Order Calculations 
K*o.13 = Ko, — Bos.r Gos.1 
K7o.03 = Kon — Bos.2 Gos.2 
Partial Correlation Coefficients 
Tor.23 = (1 — ko .103/K?o.23) ' 
Yo2.13 = (1 — F*o.123/K*o.13)* 
Yos.r2 = (1 — F°o.123/K*o12)* « 


The partial correlation coefficient always takes the sign of its cor- 
responding beta coefficient. Thus, if 8o:.; is negative, 71.23; will also 
be negative. 


4. Generalized Formulas 


The concepts and formulas which we have employed in this ar- 
ticle are generalized below. The subscripts 1, 2, 3, etc., refer to the 
sequence of the subscripts in the formulas, and 7 refers to the num- 
ber of variables. Other symbols will be explained as they occur in 
the formulas. 


(First order incomplete regression coefficient, alpha coefficient 
—three variables. aj2.3 = 21.3) 


012.3 = N32 — 113 N23 . ° . . . ° . . . . . [5a] 
({n—2]* order alpha coefficient—n>3 variables. 


Gj2.34 + + + mn = Oe1.34+-- n) 
12.34 « + » m — G12.34 «+ » (n-1) —— Bin.sa e+ (n-1) One.34 ++ + (n-1) . [5b] 
(First order non-determination coefficient—three variables) 


Ks. = ]1]— 110 [6a] 
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({n—1] order non-determination coefficient—n>38 variables) 
ks 23 -- +n 103... (n-1) — Bin.23 ++ (m-1) Ain.23 + + + (m-1) ° [6b] 
({n—2]* order intermediate partial regression coefficient, beta 
coefficient—n variables. This is the formula used in ob- 


taining intermediate beta coefficients; the formulas used 
for the final beta coefficients are given under Formula 8. 


Bi2.34 coon Bor.s4 seen) 
Biz.s4 + + + n= O12.34- ++ n/Wese- eon . . . ios . ° -[T7] 


(Final beta coefficients—n variables. In these generalized for- 
mulas for the final beta coefficients, the subscript 0 indi- 
cates the dependent variable or criterion and the subscript 
y [Greek nu] indicates the number of independent vari- 
ables. Bov.se ee (v-1) + Bvo.12 cee (v-1)) 


Bov.re eee (v-1) == GAov.i2-6- (v-1)/K v.12 ee (¥-1) : : . ° ° [8a] 


Bowv-1).22 eeoev Bo v-1).12 Segre 


Bov.12 eee (V-1) Bvv-ry.12 ee (v-2) . e . e e e ° [8b] 


Bovv-2).12 id Bov-2).12 eee (v-3) —— 
(Bocv-1).12 sa i Byv-1) (v-2).12 eee (¥-3) 1 
Bov.12 2° (V-1) Bv(v-2y.12 cee (v-3) ) ° ‘ ° i ° ° [8] 


Bos.r2 coe y = Bos.12 oe (Bos.r2 ooov Ba3.12 + 

Bos.r2 +++ v Bss.r2 tes Bovaz «+» (v1) Bvs.r2) . : . [8x] 
Boz.13 ee as Boz.1 7 (Bos.12 eeev Bs2.1 + 

Bosra eeey Baza + +++ + Bove «© (v1) Bro.) _-« »« » oo 
Bor.23 + +» v==To1 — (Boos. ++ v 112+ 

Bos.12 wee v Ny -- tee + Bov.re e © 6 (v1) iw) r J ; ; ‘ [8z] 

({n—1]* order multiple correlation coefficients—n variables) 

ee) es |) 


({[n—2] order regression coefficient—n variables. 
Das. ci eoeoev =a Draws eee v) 


Dine se es Bor.23 eoov 00/01 r e = . . . . - [10a] 
oo coo vy Bos.as eeerv 00/2 ° . ° . . . . . [10b] 
ete. 


(Constant in regression equation for raw scores—n variables) 
C= M,— (01.23 eee yM,+ Doo.13 eee vM.-+--- 
sig — Devas eee (V-1) M,) . . . . . . . . . [11] 
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(Regression equation in terms of raw scores—n variables) 


Xo = Doses... v Xi Boris...» Xe 

ee ee a er ae 
(Standard error of estimate—n variables) 

Pict oes + + a) 6 & Yr wy Sore oe) 


(Standard deviation of the beta coefficients—n variables) 
%. = (i .22 eee v/ ke? 1.23 eeev N)? e ° . . . [14a] 


01.23 ->++v 


a (k*, Wess v/Ke*e13 eee N)? . . ° . . [14b] 
ete. 


(Partial correlation coefficient—n variables. In these general- 
ized formulas for the partial correlation coefficient, the 
subscript 0 indicates the dependent variable and the sub- 
script » indicates the number of independent variables. 
The partial correlation coefficient always takes the sign of 
its corresponding beta coefficient. Although o:..; ~ f10.23, 
we will find that 701.2; = 110.23; this is because 


To1.23 = (Bo1.23 B10.23)* ) 
Tor.ese + «+ v= (1 — ko103... v/Ko0s4.-+v)* . . 5 [15a] 
Tenses sss ee C1 a i gaey ss ef/Maaassse)! «6 et [15b] 


02.13 +++ ¥ 


To(v-1).123 © +» v= (1 — Bo 103... v/ Ke o.103 « « « (v-2yv)# . : [15y] 
Tovrea + + + (vty == (1 — Fores... v/ Roses... ww-eyqv-ty)* =. S57] 
In addition to these fundamental formulas there are two possible 


checks on operations. Formula 16 should always be applied when final 
beta coefficients are computed. 


(Check on the calculation of the final beta coefficients—n vari- 
ables) 


To2 = Bore +. -v 112 + Bors. ++ v + Bose». sv Test 
Bosse eee v Nog - ise + Bov.12 eee (V1) Tov . . . . . [16] 


(Alternative calculation of the multiple correlation coefficient 
n variables) 


Bate ss sve ( Box.23 « »»v Tor + Boras. ++ To2+:: 
le Bov.r2 2 ++ (vay Tov)* ‘, ° i =. & ‘ ‘ ‘ [9b] 
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5. Comparisons 


These generalized formulas should enable the psychometrician 
to arrange schemas with which to solve his own problems. The great- 
est saving will, of course, be effected in those problems for which 
these new formulas were originally intended—the calculation of the 
beta coefficients, the multiple correlation coefficient, the standard er- 
ror of estimate, and the construction of the regression equation. In 
Table I we have compared the number of arithmetic processes re- 
quired to find Bo1.23459 Boz.18455 Bos.12455 Bos.12355 and Bos.1234 in a six-variable 
problem by Moore’s determinental method* with the number required 
by the writer’s new formulas. The saving effected by the latter is 
readily apparent. 

The new formulas, however, are also economical of time and ef- 
fort in solving partial correlation problems. For example, if we wish 
merely to obtain the partial correlation coefficients without the other 
constants, we shall find that for the same number of significant fig- 


TABLE I 


COMPARISON OF THE NUMBER OF OPERATIONS REQUIRED 
BY DETERMINANTS (MOORE’S METHOD) AND THE NEW 
FORMULAS IN SOLVING FOR 


Bo1.23455 Boz.13459 Bos.1245 Bos.12359 AND Bos.1234 


IN A SIX-VARIABLE PROBLEM 























Determinants | New Formulas 
Additions 24 : 6 
Subtractions 56 30 
| Multiplications 122 36 
| Divisions 15 10 
| Tables 7 pis a 














*Moore, Thomas Verner. Multiple Correlation and the Correlation between 
General Factors. Catholic University of America Studies in Psychology and 
Psychiatry, vol. III, no. 1. 
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ures retained in intermediate operations the new formulas will ob- 
tain the solutions with less calculations and with greater accuracy 
than will the older formulas of the type: 


" ana 11.23 + © © (v-1) —— Tov.23 + © (v-1) 11v.23 + © + (v-1) 
01.23 ** eV 
(1 — 1 ov.23 one (vy)? (1 — 1" v.93 eee (vy)! 





Table II contains a comparison of the number of operations required 
by the old and new methods in finding the partial correlation coef- 
ficients, 191.234, Y 02.134) 03.124 ANG 1'o4.123, in a five-variable problem. Again 
the efficiency of these new formulas is demonstrated. 


TABLE II 


COMPARISON OF THE NUMBER OF OPERATIONS REQUIRED 
BY THE OLD AND THE NEW FORMULAS IN SOLVING FOR 


101.2349 102.134) 103.124 ANA 1o4.123 


IN A FIVE-VARIABLE PROBLEM 





Old Formulas | New Formulas 
(Yule’s) (Griffin’s) 





| 
| 
| 


Additions | 0 0 








Subtractions 42 | 39 


Multiplications | 44 28 





Divisions 22 








Squares | 20 





Square Roots | 20 




















